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BAKER BOTTS L.L.P. 
30 ROCKEFELLER PLAZA 
NEW YORK, NEW YORK 10112 



TO ALL WHOM IT MAY CONCERN: 

Be it known that We, Lloyd G. Mitchell, Mariano A. Garcia-Blanco, 
Madaiah Puttaraju and S. Gary Mansfield, citi2ens of the United States, residing in the 
United States, City of Durham, State of North Carolina, whose post office addresses are 
4500 Highgate Drive, Durham, North Carolina 2771 3, 12 Sanderling Court, Durham, 
North Carolma 27713, 416 Tall Oaks Drive, Durham. North Carolina 27713 and 1005 
Prologue Road, Durham, North Carolina 27712, respectively, have invented an 
improvement in 

"Methods and Compositions for Use in Spliceosome Mediated RNA rraAis-splicing" 

of which the following is a 



SPECIFICATION 



The present application is a continuation-in-part of pending application 
serial number 09/158,863 fded September 23, 1998 which is a continuation-in-part of 
serial number 09/133,717 filed on August 13, 1998 which is a continuation-in-part of 
serial number 09/087,233 filed on May 28, 1998, which is a continuation-in-part of 
pending application serial number 08/766,354 filed on December 13, 1996, which clauns 
benefit to provisional application number 60/008,317 filed on December 15, 1995. 

The present invention was made with government support under Grant 
Nos. SBIR R43DK56526-01 and SBIR R44DK5 6526-02. The government has certain 
rights in the invention. 

1. fNTRODUCTION 
The present invention provides methods and compositions for generating 
novel nucleic acid molecules through targeted spliceosomal /raw^-splicing. The 
compositions of the invention include pre-/r<ms-splicing molecules (PTMs) designed to 
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interact with a natural target precursor messenger RNA molecule (target pre-mRNA) and 
mediate a /raw^-splicing reaction resulting in the generation of a novel chimeric RNA 
molecule (chimeric RNA). The PTMs of the invention are genetically engineered so as to 
result in the production of a novel chimeric RNA which may itself perform a function, 

5 such as mhibiting the translation of the RNA, or that encodes a protein that complements 
a defective or inactive protein in a cell, or encodes a toxin which kills specific cells. 
Generally, the target pre-mRNA is chosen as a target because it is expressed within a 
specific cell type thus providing a means for targeting expression of the novel chimeric 
RNA to a selected cell type. The invention further relates to PTMs that have been 

10 genetically engineered for the identification of exon/intron boundaries of pre-mRNA 

molecules using an exon tagging method. In addition, PTMs can be designed to resuU in 
the production of chimeric RNA encoding for peptide affinity purification tags which can 
be used to purify and identify proteins expressed in a specific cell type. The methods of 
the invention encompass contacting the PTMs of the invention with a target pre-mRNA 

1 5 under conditions in which a portion of the PTM is trans-spliccd to a portion of the target 
pre-mRNA to form a novel chimeric RNA molecule. The methods and compositions of 
the invention can be used m cellular gene regulation, gene repair and suicide gene therapy 
for treatment of proliferative disorders such as cancer or treatment of genetic, 
autoimmune or infectious diseases. In addition, the methods and compositions of the 

20 invention can be used to generate novel nucleic acid molecules in plants through targeted 
splicesomal /r(m?-splicing. For example, targeted /raw-splicing may be used to regulate 
gene expression in plants for treatment of plants diseases, engineering of disease resistant 
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plants or expression of desirable genes in plants. The methods and compositions of the 
invention can also be used to map intron-exon boundaries and to identify novel proteins 
expressed in any given cell. 

2. BACKGROUND OF THE INVENTION 

5 DNA sequences in the chromosome are transcribed into pre-mRNAs 

which contain coding regions (exons) and generally also contain intervening non-coding 
regions (mtrons). Introns are removed from pre-mRNAs in a precise process called 
splicing (Chow etal, 1977, Cell 12:1-8; and Berget, S.M. et al, 1977, Proc. Natl. Acad. 
Sci. USA 74:3171-3175). Splicing takes place as a coordinated interaction of several 

10 small nuclear ribonucleoprotein particles (snRNP's) and many protein factors that 

assemble to form an enzymatic complex known as the spliceosome (Moore et al ,1993, in 
The RNA World, R.F. Gestland and J.F. Atkins eds, (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y.); Kramer, 1996, Annu. Rev. Biochem., 65:367-404; 
Staley and Guthrie, 1998, Cell 92:315-326). 

15 Pre-mRNA splicing proceeds by a two-step mechanism. In the first step, 

the 5' splice site is cleaved, resulting in a "free" 5' exon and a lariat intermediate (Moore, 
M.J. and P.A. Sharp, 1993, Nature 365:364-368). In the second step, the 5* exon is 
ligated to the 3' exon with release of the intron as the lariat product. These steps are 
catalyzed in a complex of small nuclear ribonucleoproteins and proteins called the 

20 spliceosome. 
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The splicing reaction sites are defined by consensus sequences around the 
5* and 3' splice sites. The 5' splice site consensus sequence is AG/GURAGU (where 
A=adenosine, U = uracil, G = guanine, C = cytosine, R = purine and / = the splice site). 
The 3* splice region consists of three separate sequence elements: the branch point or 

5 branch site, a polypyrimidine tract and the 3' splice consensus sequence (YAG). These 
elements loosely define a 3* splice region, which may encompass 100 nucleotides of the 
intron upstream of the 3* splice site. The branch point consensus sequence in mammals is 
YNYURAC (where N = any nucleotide, Y= pyrimidme). The underlined A is the site of 
branch formation (the BPA = branch point adenosine). The 3* splice consensus sequence 

1 0 is YAG/G. Between the branch point and the splice site there is usually found a 

polypyrimidine tract, which is important in mammalian systems for efficient branch point 
utilization and 3' splice site recognition (Roscigno, R., F. et al, 1993, J. Biol. Chem. 
268:1 1222-1 1229). The first YAG trinucleotide downstream fi-om the branch point and 
polypyrimidine tract is the most commonly used 3* splice site (Smith, C.W, et al, 1989, 

15 Nature 342:243-247). 

In most cases, the splicing reaction occurs within the same pre-mRNA 
molecule, which is termed c/^-splicing. Splicing between two independently transcribed 
pre-mRNAs is termed rrara-splicing. Tran^-splicing was first discovered in 
trypanosomes (Sutton & Boothroyd, 1986, Cell 47:527; Murphy et al, 1986, Cell 

20 47:5 1 7) and subsequently in nematodes (Krause & Hirsh, 1987, Cell 49:753); flatwonns 
(Rajkovic etaU 1990, Proc. Nat'l. Acad. Sci. USA, 87:8879; Davis etal, 1995, J. Biol. 
Chem. 270:21813) andinplantmitochondria(Maleke/ a/., 1997,Proc.Nat'l. Acad. Sci. 
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USA 94:553). In the parasite Trypanosoma brucei, all mRNAs acquire a splice leader 
(SL) RNA at their 5' termini by /ran^-splicing. A 5' leader sequence is also trans-spliced 
onto some genes in Caenorhabditis elegans. This mechanism is appropriate for adding a 
single common sequence to many different transcripts. 

The mechanism of rraw^-splicing, which is nearly identical to that of 
conventional cij-splicing, proceeds via two phosphoiyl transfer reactions. The first 
causes the formation of a 2'-5' phosphodiester bond producing a ' Y* shaped branched 
intermediate, equivalent to the lariat intermediate in c/^-splicing. The second reaction, 
exon ligation, proceeds as in conventional c/^-splicing. In addition, sequences at the 3' 
splice site and some of the snRNPs which catalyze the trans-splicmg reaction, closely 
resemble their counterparts involved in cr^-splicing. 

Tramr-splicing may also refer to a different process, where an intron of one 
pre-mRNA mteracts with an intron of a second pre-mRNA, enhancing the recombination 
of splice sites between two conventional pre-mRNAs. This type of /ran^-splicing was 
postulated to account for transcripts encoding a human immunoglobulin variable region 
sequence linked to the endogenous constant region in a transgenic mouse (Shimizu et 
fl/.,1989, Proc. Nat'l. Acad. Sci. USA 86:8020). In addition, fran^-splicing of c-myb pre- 
RNA has been demonstrated (Vellard, M, et al, Proc. Natl. Acad. Sci., 1992 89:251 1- 
2515) and more recently, RNA transcripts firom cloned SV40 trans-spliced to each other 
were detected in cultured cells and nuclear extracts (Eul et al, 1995, EMBO. J. 14:3226). 
However, naturally occurring /r^z/is-splicing of mammalian pre-mRNAs is thought to be 
an exceedingly rare event 
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In vitro traw^-splicing has been used as a model system to examine the 
mechanism of splicing by several groups (Konarska & Sharp, 1985, Cell 46:165-171 
Solnick, 1985, Cell 42:157; Chiara & Reed, 1995, Nature 375:510; Pasman and Garcia- 
Blanco, 1996, Nucleic Acids Res. 24:1638). Reasonably efficient /rcms-splicing (30% of 
cz5-spliced analog) was achieved between RNAs capable of base pairing to each other, 
splicing of RNAs not tethered by base pairing was further diminished by a factor of 10. 
Other in vitro /raw5-splicing reactions not requiring obvious RNA-RNA interactions 
among the substrates were observed by Chiara & Reed (1995, Nature 375:510), Bruzik 
J.P. & Maniatis, T. (1992, Nature 360:692) and Bruzik J.P. and Maniatis, T., (1995, Proc. 
Nafl. Acad. Sci. USA 92:7056-7059). These reactions occur at relatively low frequencies 
and require specialized elements, such as a downstream 5* splice site or exonic splicing 
enhancers. 

In addition to splicing mechanisms involving the binding of multiple 
proteins to the precursor mRNA which then act to correctly cut and join RNA, a third 
mechanism involves cutting and joining of the RNA by the intron itself, by what are 
termed catalytic RNA molecules or ribozymes. The cleavage activity of ribozymes has 
been targeted to specific RNAs by engineering a discrete "hybridization" region into the 
ribozyme. Upon hybridization to the target RNA, the catalytic region of the ribozyme 
cleaves the target. It has been suggested that such ribozyme activity would be useful for 
the inactivation or cleavage of target RNA in vivo, such as for the treatment of human 
diseases characterized by production of foreign of aberrant RNA. The use of antisense 
RNA has also been proposed as an alternative mechanism for targeting and destruction of 
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specific RNAs. In such instances small RNA molecules are designed to hybridize to the 
target RNA and by binding to the target RNA prevent translation of the target RNA or 
cause destruction of the RNA through activation of nucleases. 

Until recently, the practical application of targeted trans-splicmg to 

5 modify specific target genes has been limited to group I ribozyme-based mechanisms. 
Using the Tetrahymena group 1 ribozyme, targeted trans-spWcmg was demonstrated in E, 
coll coll (Sullenger B.A. and Cech, T.R., 1994, Nature 341:619-622) , in mouse 
fibroblasts (Jones, J.T. et al., 1996, Nature Medicine 2:643-648), human fibroblasts 
(Phylacton, LA. et al. Nature Genetics 1 8:378-381) and human erythroid precursors fLan 

10 et al., 1998, Science 280:1593-1596), While many applications of targeted RNA trans- 
splicing driven by modified group I ribozymes have been explored, targeted trans- 
splicing mediated by native mammalian splicing machinery, z.e., spliceosomes, has not 
been previously reported. 



3. SUMMARY OF THE INVENTION 

15 The present invention relates to compositions and methods for generating 

novel nucleic acid molecules through spliceosome-mediated targeted fron^-splicing. The 
compositions of the invention include pre-Zr^/z^-splicing molecules (hereinafter referred 
to as "PTMs") designed to mteract v^th a natural target pre-mRNA molecule (hereinafter 
referred to as "pre-mRNA") and mediate a spliceosomal /rara-splicmg reaction resulting 

20 in the generation of a novel chimeric RNA molecule (hereinafter referred to as "chimeric 
RNA"). The methods of the invention encompass contacting the PTMs of the invention 
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with a natural target pre-mRNA under conditions in which a portion of the PTM is 
spliced to the natural pre-mRNA to form a novel chimeric RNA . The PTMs of the 
invention are genetically engineered so that the novel chimeric RNA resulting from the 
trans-spUcmg reaction may itself perform a function such as inhibiting the translation of 

5 RNA, or alternatively, the chimeric RNA may encode a protein tiiat complements a 
defective or inactive protein in the cell, or encodes a toxin which kills the specific cells. 
Generally, the target pre-mRNA is chosen because it is expressed within a specific cell 
type thereby providing a means for tai^eting expression of the novel chimeric RNA to a 
selected cell type. The target cells may include, but are not limited to those infected with 

10 viral or other infectious agents, benign or malignant neoplasms, or components of the 

immime system which are involved in autoimmune disease or tissue rejection. The PTMs 
of the invention may also be used to correct genetic mutations foimd to be associated with 
genetic diseases. In particular, double-rrofis-splicing reactions can be used to replace 
internal exons. The PTMs of the invention can also be genetically engineered to tag exon 

1 5 sequences in a mRNA molecule as a method for identifying intron/exon boundaries in 
target pre-mRNA. The invention further relates to the use of PTM molecules that are 
genetically engineered to encode a peptide affinity purification tag for use in the 
purification and identification of proteins expressed in a specific cell type. The methods 
and compositions of the invention can be used in gene regulation, gene repair and 

20 targeted cell death. Such methods and compositions can be used for the treatment of 

various diseases including, but not limited to, genetic, infectious or autoimmune diseases 
and proliferative disorders such as cancer and to regulate gene expression in plants. 
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4. BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 A. Model of Pre-rra/i^-splicing RNA. 

Figure IB. Model PTM constructs and targeted fran^-splicing strategy. 
Schematic representation of the first generation PTMs (PTM+Sp and PTM-Sp). BD, 
binding domain; NBD, non-binding domain; BP, branch point; PPT, pyrimidine tract; ss, 
splice site and DT-A, diphtheria toxin subunit A. Unique restriction sites within the 
PTMS are indicated by smgle letters: E; EcoRI; X, Xhol; K, Kpnl; P, Pstl; A, Accl; B, 
BamHIandH; HindHL 

Figure IC. Schematic drawing showing the binding of PTM+Sp via 
conventional Watson Crick base pairing to the pHCG6 target pre-mRNA and the 
proposed cis~ and frflm-splicing mechanism. 

Figure 2A. In vitro /raw5"Splicing efficiency of various PTM constructs 
into pHCG6 target. A targeted binding domain and active splice sites correlate with PTM 
/r^ms-splicing activity. Full length targeted (pcPTM+Sp), non-targeted (PTM-Sp) and the 
splice mutants [Py(-)AG(-) and BP(-)Py(-)AG(-)] PTM RNAs were added to splicing 
reactions containing |5HCG6 target pre-mRNA. The products were RT-PCR amplified 
using primers pHCG-F (specific for target PHCG6 exon 1) and DT-5R (complementary 
to DT-A) and analyzed by electrophoresis in a 1.5% agarose gel. 

Figure 2B. In vitro /raw^-splicing efficiency of various PTM constructs. 
Full length PTM with a spacer between the binding domain and splice site (PTM+Sp), 
PTM without the spacer region (PTM4-) and short PTMs that contain a target binding 
domain (short PTM+ ) or a non-target binding region (PTM-) were added to splicing 
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reactions containing pHCG target pre-mRNA, The products were RT-PCR amplified 
using primers pHCG-F and DT-3. For reactions containing the short PTMs, the reverse 
PGR primer was DT-4, since the binding site for DT-3 was removed from the PTM. 

Figure 3. Nucleotide sequence demonstrating the in vitro /raw-spliced 
product between a PTM and target pre-mRNA. The 466 bp /ran^-spliced RT-PCR 
product from Figure 2 (lane 2) was re-amplified using a 5' biotin labeled forward primer 
(pHCG-F) and a nested unlabeled reverse primer (DT-3R). Single stranded DNA was 
purified and sequenced directly using toxin specific DT-3R primer. The arrow indicates 
the splice jimction between the last nucleotide of target pHCG6 exon I and the first 
nucleotide encoding DT-A. 

Figure 4A. Schematic diagram of the "safety" PTM and variations, 
demonstrating the PTM intramolecular base-paired stem, intended to mask the BP and 
PPT from splicing factors. Underlined sequences represent the pHCG6 intron 1 
complementary target-binding domain, sequence in italics indicate target mismatches that 
are homologous to the BP. 

Figure 4B. Schematic of a safety PTM in open configuration upon 
binding to the target. 

Figure 4C. In vitro tranS'Splicing reactions were carried out by incubating 
either safety PTM or safety PTM variants with the pHCG6 target. Splicing reactions 
were amplified by RT-PCR using pHCG-F and DT-3R primers; products were analyzed 
in a 2.0% agarose gel. 
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Figure 5. Specificity of targeted trans-splicmg is enhanced by the 
inclusion of a safety into the PTM. pHCG6 pre-mRNA (250 ng) and P-globin pre- 
mRNA (250 ng) were annealed together with either PTM+SF (safety) or pcPTM+Sp 
(linear) RNA (500 ng). In vitro ^ra/u-splicmg reactions and RT-PCR analysis were 
performed as described under experimental procedxires and the products were separated 
on a 2.0% agarose gel. Primers used for RT-PCR are as indicated. 

Figure 6. In the presence of increasing PTM concentration, m-splicing is 
inhibited and replaced by /ran^-splicing. In vitro splicing reactions were performed in the 
presence of a constant amount of PHCG6 target pre-mRNA (1 00 ng) with increasing 
concentrations of PTM (pcPTM+Sp) RNA (52-300 ng). RT-PCR for cz^-spliced and un- 
spliced products utilized primers pHCG-F (exon 1 specific) and pHCG-R2 (exon 2 
specific - Panel A); primers pHCG-F and DT-3R were used to RT-PCR rra?iy-spliced 
products (Panel B). Reaction products were analyzed on 1.5% and 2.0% agarose gels, 
respectively. In panel A, lane 9 represents the 60 min time point in the presence of 
300 ng of PTM, which is equivalent to lane 10 in panel B. 

Figure 7A. PTMs are capable of rron^-splicing in cultured himian cancer 
cells. Total RNA was isolated firom each of 4 expanded neomycin resistant HI 299 lung 
carcinoma colonies transfected with pcSjH-CRM (expressing non-toxic mutant DT-A) 
RT-PCR was performed using 1 ng of total RNA and 5' biotinylated pHCG-F and non- 
biotinylated DT-3R primers. Single stranded DNA was purified and sequenced. 
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Figure 7B. Nucleotide sequence (sense strand) of the /ran5-spliced 
product between endogenous pHCG6 target and CRM197 mutant toxin is shown. Two 
arrows indicate the position of the splice junction. 

Figure 8A. Schematic diagram of a double splicing pre-therapeutic 

mRNA. 

Figure 8B. Selective /rany-splicing of a double splicing PTM. By varying 
the PTM concentration the PTM can be tran^-spliced mto either the 5' or the 3' splice site 
of the target. 

Figure 9. Schematic diagram of the use of PTM molecules for exon 
tagging. Two examples ofPTMs are shown. The PTM on the left is capable of non- 
specifically trans-splicing into a target pre-mRNA 3' splice site. The other PTM on the 
right is designed to non-specifically trans-splice into a target pre-mRNA 5' splice site. A 
PTM mediated trans-splicing reaction will result in the production of a chimeric RNA 
comprising a specific tag to either the 5' or 3' side of an authentic exon. 

Figure lOA. Schematic diagram of constructs foi use in the lacZ knock- 
out model. The target lacZ pre-mRNA contains the 5* fiiigment of lacZ followed by 
pHCG6 intron 1 and tiie 3' fragment of lacZ (target 1). The PTM molecule for use in the 
model system was created by digesting pPTM +SP with PstI and Hindlll and replacing 
the DT-A toxin with pHCG6 exon 2 (pc3.1PTM2). 

Figure lOB. Schematic diagram of restoration of P-Gal activity by 
Spliceosome Mediated RNA Trans-splicing. Schematic diagram of constructs for use in 
the lacZ knock-in model (pc.3.1 lacZ T2). The lacZ target pre-mRNA is identical to that 

NY02,3O21461 "12- 



3WPB-A-B 072874.0135 



target pre-mRNA used for the knock-out experiments except that it contains two stop 
codons (TAA TAA) in frame four codons after the 3' splice site. The PTM molecule for 
use in the model system was created by digesting pPTM +SP with PstI and Hindlll and 
replacing the DT-A toxin with functional 3' fragment of lacZ. 

Figure 1 1 A. Demonstration of c/5-and /rart?-splicing when utilizing the 
lacZ knock-out model. The LacZ splice target 1 pre-mRNA and PTM2 were 
co-transfected into 293T cells. Total RNA was then isolated and analyzed by PC:R for 
m-spliced and /r^ns'-spliced products using the appropriate specific primers. The 
amplified PGR products were separated on a 2% agarose gel. 

Figure 1 IB-C. Assays for p-galactosidase activity. 293 cells were 
transfected with lacZ target 2 DNA alone (panel B) or lacZ target 2 DNA and PTMl 
(panel C). 

Figure 12 A. Nucleotide sequence of /r^Tis-spliced molecule 
demonstrating accurate /ran^y-splicing. 

Figure 12B. Nucleotide sequences of the m-splic«d product and the 
trcms-splicGd product. The nucleotide sequences were those sequences expected for each 
of the different splicing reactions. 

Figure 13. Gene repair model for repair of the cystic fibrosis 
transmembrane regulator (CFTR) gene. 

Figure 14. RT-PCR demonstration of frow-splicing between an 
exogenously supplied CFTR mini-gene target and PTM. Plasmids were co-transfected 
into 293 embryonic kidney cells. The primers pairs used for RT-PCR reactions are listed 
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above each lane. The lower band (471 bp) in each lane represents a rran^-spliced 
product. The lower band in lane 1 (471 bp) was purified from a 2% Seakem agarose gel 
and the DNA sequence of the band was determmed. 

Figure 15. DNA sequence of the /rara-spliced product (lane 1, lower band 
5 shown in Figure 14), The DNA sequence indicates the presence of the F508 codon 

(CTT), exon 9 sequence is contiguous with exon 10 sequence, and the His tag sequence. 

Figure 1 6. Schematic representation of repair of an exogenously supplied 
CFTR target molecule carrying an F508 deletion in exon 10. 

Figure 17. Repair of endogenous CFTR transcripts by exon 10 
1 0 replacement using a double splicing PTM. The use of a double spUcmg PTM permits 
repair of the a508 mutation with a very short PTM molecule. 

Figure 1 8. Model lacZ target consisting of lacZ 5' exon - CFTR 
mini-mtron 9 - CFTR exon 10 (delta 508) - CFTR mini-intron 10 followed by the 
lacZ 3' exon. Binding domains for PTMs are bracketed. 
1 5 Figure 1 9. Schematic representation of double-Zrara-splicing PTMs 

designed to restore p-gal function. 

Figure 20. Schematic representation of a double-rra/iy-splicing reaction 
showing the binding of DSPTM7 vrith DSCFTl .6 target pre-mRNA. 

Figure 21. Important structural elements of DSPTM7. The double 
20 splicing PTM has both 3' and 5' functional splice sites as well as binding domains. 

Figure 22. Schematic diagram of mutant double splicing PTMs. 

Figure 23. Accuracy of double-Zr^tw-splicing reaction. 
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Figure 24, Double-/ra?zy-splicing between the target pre-mRNA and the 
DSPTM7 produces fiilHength protein. Western blot analysis of total cell ly sates using 
polyclonal anti-P-galactosidase antiserum. 

Figure 25. Precise internal exon substitution between the DSCFTl .6 
5 target pre-mRNA and DSPTM7 RNA by double-Zram-splicing produces fiinctionally 
active p-gal protein. Total cell extracts were prepared and assayed for p-gal activity 
using an ONPG assay. 

Figure 26. 3' and 5' splice sites are essential for the restoration of P-gal 
function by double-^ram'-splicing reaction. 
10 Figure 27. Double-rra/ty-splicing: titration of target and PTM. Different 

concentrations of the target and PTM were co-transfected and analyzed for p-gal activity 
restoration. 

Figure 28. Constructs designed to test the specificity of 
double-rra/iy-splicing reaction. 
1 5 Figure 29. Specificity of a double-Zran^-splicmg reaction. 

Figure 30. Tram-spMcmg repair of the cystic fibrosis gene using a PTM 
that mediates a double-/raw5-splicing event. 

Figure 3 1 . PTM with a long bindmg domain maskmg two splice sites and 
part of exon 1 0 in a mini-gene target. 
20 Figure 32. Sequence of a single PGR product showing target exon 9 

correctly spliced to PTM exon 10 (with modified codons) (upper panel), codon 508 in 
exon 10 of the PTM (middle panel) and PTM exon 10 correctly spliced to target exon 1 1 
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(lower panel). The sequence of a repaired target was generated by RT-PCR followed by 
PGR. 

Figure 33. rron^-splicing repair of the cystic fibrosis gene using a PTM 
that can perform 5' exon replacement. 

Figure 34. Schematic diagram of three different PTM molecules with 
different binding domains. 

Figure 35. Schematic diagram of PTM exon 10 with modified codon 
usage to reduce antisense effects with its own binding domain. 

Figure 36. Sequence of cis- and /rara-spliced products. 

Figure 37. Model system for repair of messenger RNAs by /rfl/u-splicing. 
(A) Schematic illustration of a defective lacZCF9m splice target used in the present 
study (see Materials and Methods for details). BP, branch point; PPT, polypyrimidine 
tracts; ss, splice sites and pA, polyadenylation signal. (B) A prototype PTM showing the 
key components of the /raw5-splicing domain, and the diagrams of various PTMs 
showing the binding domain length and approximate positions at which they bind to the 
target pre-mRNA. Unique restriction sites within the /ran^-splicing domain are N, Nhe I; 
S, Sac II; K, Kpn I and E, EcoR V. (C) Schematic diagram showing the binding of a 
PTM through antisense binding and repair of defective lacZ pre-mRNA through targeted 
RNA /ra«5-splicing. Expected cis and trans-spliced products and the primer binding sites 
for Lac-9F, Lac-3R and Lac-5R are indicated. 

Figure 38. Efficientrepair of /acZ messenger RNA. Target specific 
primers, Lac-9F (5' exon) and Lac-3R (3' exon) were used to amplify c/s-spliced products 
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(lanes 1-6), while; target and PTM specific primers, Lac-9F (5' exon) and Lac-5R 
(3' exon) were used to amplify /ra«5-spliced products (lanes 7-15). 25-50 ng of total 
RNA was used to measure target c/>splicing (lanes 1-6) and 50-200 ng of total RNA was 
used to measure PTM induced RNA /ranj-splicing (lanes 7-12). Lanes 13-15, 25-50 ng 
5 of total RNA from cells transfected with lacZCF9 a control for /raw5-splicing. 

(B) Endogenous mRNA repair by trans-splicing. Lanes 1-3, RNA from cells transfected 
with PTM-CF14; lanes 4-6, PTM-CF22 and lanes 7-9, PTM-CF24. Lane 10, RNA from 
mock-transfected cells and lane 1 1 is a control in which reverse-transcription reaction 
was omitted. 

1 0 Figure 39. Messenger RNA repair leads to synthesis of full-length 

p-galactosidase. Lane 1, lacZCF9 (positive control, 5 /zg); lane 2, lacZCF9m target alone 
(25 Aig); lane 3, PTM-CF24 alone (25 ^g) and lane 4, lacZCF9m target + PTM-CF24 

Figure 40. Messenger RNA repair by SMaRT produces functional 
15 p-galactosidase. (A) In situ detection of functional p-galactosidase produced by trans- 
splicing. 293T cells were either transfected (transient assay) with lacZCF9m target alone 
(panel A) or co-transfected with lacZCF9m target + PTM-CF24 (panel B) expression 
plasmids as described above. 48-hr post-transfection, cells were rinsed with PBS and 
stained /w 5/Ym for p-gal activity. (B) Repair of a defective lacZ mRNA produces 
20 functional p-galactosidase. Target and PTM, extracts from cells transfected with either 
lacZCF9m target or PTM-CF24 plasmid alone, and the rest were from cells co- 
transfected with lacZCF9m target and one of the PTMs as indicated. (C) Endogenous 
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mRNA repair by trans-splicing produces functional p-galactosidase. Stable cells 
expressing an endogenous lacZCF9m pre-mRNA target was transfected with "lineaf ' 
PTMs (PTM-CF14, PTM-CF22 or PTM-CF24) as described above. Following 
transfection, total cell lysate was prepared and assayed for P-gal activity. The results 

5 presented are the average of two independent transfections. 

Figure 41. Messenger RNA repair is specific. (A) Experimental strategy 
to measure non-specific trans-splicing between lacZHCGlm pre-mRNA and "linear" 
PTMs. (B) Extended binding domains enhance the specificity of trans-spHcing. 
Lanes 1-3, PTM-CF14; 4-6, PTM-CF22; 7-9, PTM-CF24; 10-12, PTM-CF26 and 13-15, 

1 0 PTM-CF27. (C) PTMs with very long binding domains are capable of increasing 

specificity. Total cell extract (5 ^\) was assayed in solution for p-gal activity and the 
specific activity was calculated, p-gal activity was normalized to mock and the results 
presented are the average of two mdependent transfections. Control, extract firom cells 
transfected with lacZHCGlm target alone and the rest were co-transfected with 

1 5 lacZHCGl m target and one of the linear PTMs. 

Figure 42. Complete sequence of CFTR PTM 30 (5* exon replacement 
PTM) showing the trans-splicing domain (underlined) and the coding sequence for exons 
1-10 of the CFTR gene. Modified codons in exon 10 are underlined and bold. 
Figure 43 A. 1 53 base-pair PTM 24 Bmding Domain. 

20 Figure 43B. Complete sequence of CFTR PTM 24 (3' exon replacement 

PTM) showing the /raw^-spUcing domain (underlined) and the coding sequence for exons 
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10-24 of the CFTR cDNA. At the end of the coding is a histidine tag and the translation 
stop codon. 

5. DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to compositions comprising prc-trans- 

5 splicing molecules (PTMs) and the use of such molecules for generating novel nucleic 
acid molecules. The PTMs of the invention comprise one or more target binding domains 
that are designed to specifically bind to pre-mRNA, a 3' splice region that includes a 
branch point, pyrimidine tract and a 3' splice acceptor site and/or a 5' splice donor site; 
and one or more spacer regions that separate the RNA splice site from the target binding 

1 0 domain. In addition, the PTMs of the invention can be engineered to contain any 
nucleotide sequences such as those encoding a translatable protein product. 

The methods of the invention encompass contacting the PTMs of the 
invention with a natural pre-mRNA under conditions in which a portion of the PTM is 
trans-splictd to a portion of tiie natural pre-mRNA to form a novel chimeric RNA. The 

1 5 target pre-mRNA is chosen as a target due to its expression within a specific cell type 
thus providing a mechanism for targeting expression of a novel RNA to a selected cell 
type. The resulting chimeric RNA may provide a desired function, or may produce a 
gene product in the specific cell type. The specific cells may include, but are not limited 
to those mfected with viral or other infectiotis agents, benign or malignant neoplasms, or 

20 components of the immune system which are involved in autoimmune disease or tissue 
rejection. Specificity is achieved by modification of the binding domain of the PTM to 
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bind to the target endogenous pre-mRNA. The gene products encoded by the chimeric 
RNA can be any gene, including genes having clinical useftdness, for example, 
therapeutic or marker genes, and genes encoding toxins. 

5.1. STRUCTURE OF THE PRE-r/P/fMS-SPLICINn MOLECULES 

5 The present invention provides compositions for use in generating novel 

chimeric nucleic acid molecules through targeted /rans-splicing. The PTMs of the 
invention comprise (i) one or more target binding domains that targets binding of the 
PTM to a prc-mRNA (ii) a 3' splice region that includes a branch point, pyrimidine tract 
and a 3' splice acceptor site and/or 5" splice donor site; and (iii) one or more spacer 

1 0 regions to separate the RNA splice site from the target binding domain. Additionally, the 
PTMs can be engineered to contain any nucleotide sequence encoding a translatable 
protein product. In yet another embodiment of the invention, the PTMs can be 
engineered to contain nucleotide sequences that inhibit the translation of the chimeric 
RNA molecule. For example, the nucleotide sequences may contain translational stop 

1 5 codons or nucleotide sequences that form secondary structures and thereby inhibit 
translation. Alternatively, the chimeric RNA may fimction as an antisense molecule 
thereby inhibiting translation of the RNA to wrhich it binds. 

The target binding domain of the PTM may contain multiple binding 
domains which are complementary to and in anti-sense orientation to the targeted region 

20 of the selected pre-mRNA. As used herein, a target binding domain is defined as any 
sequence that confers specificity of binding and anchors the pre-mRNA closely in space 
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so that the spliceosome processing machinery of the nucleus can tranS'Splicc a portion of 
the PTM to a portion of the pre-mRNA. The target binding domains may comprise up to 
several thousand nucleotides. In preferred embodiments of the invention the binding 
domains may comprise at least 10 to 30 and up to several hundred nucleotides. As 
5 demonstrated herein, the specificity of the PTM can be increased significantly by 
increasing the length of the target binding domain. For example, the target binding 
domain may comprise several hundred nucleotides or more. In addition, although the 
target binding domain may be "linear" it is understood that the RNA may fold to form 
secondary structures that may stabilize the complex thereby mcreasing the efficiency of 
10 spUcing. A second target binding region may be placed at the 3' end of the molecule and 
can be incorporated into the PTM of the invention. Absolute complementarity, although 
preferred, is not required. A sequence "complementary'* to a portion of an RNA, as 
referred to herein, means a sequence having sufficient complementarity to be able to 
hybridize with the RNA, forming a stable duplex. The ability to hybridize will depend on 
1 5 both the degree of complementarity and the length of the nucleic acid (See, for example, 
Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York). Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an RNA it may contain and still 
form a stable duplex. One skilled m the art can ascertain a tolemble degree of mismatch 
20 or length of duplex by use of standard procedures to determine the stability of the 
hybridized complex. 
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Where the PTMs are designed for use in intron-exon tagging or for 
peptide affinity tagging, a library of PTMs is genetically engineered to contain random 
nucleotide sequences in the target binding domain. Alternatively, for intron-exon tagging 
the PTMs may be genetically engineered so as to lack target binding domains. The goal 

5 of generating such a library of PTM molecules is that the library will contain a population 
of PTM molecules capable of binding to each RNA molecule expressed in the cell. A 
recombinant expression vector can be genetically engineered to contain a coding region 
for a PTM including a restriction endonuclease site that can be used for insertion of 
random DNA fragments into the PTM to form random target binding domains. The 

10 random nucleotide sequences to be included m the PTM as target binding domains can be 
generated using a variety of different methods well known to those of skill in the art, 
including but not limited to, partial digestion of DNA with restriction enzymes or 
mechanical shearing of DNA to generate random fragments of DNA, Random binding 
domain regions may also be generated by degenerate oligonucleotide synthesis. The 

1 5 degenerate oligonucleotides can be engineered to have restriction endonuclease 

recognition sites on each end to facilitate cloning into a PTM molecule for production of 
a library of PTM molecules having degenerate binding domains. 

Binding may also be achieved through other mechanisms, for example, 
through triple helix formation or protein/nucleic acid interactions such as those in which 

20 the PTM is engineered to recognize a specific RNA binding protein, /. e. , a protein bound 
to a specific target pre-mRNA. Alternatively, the PTMs of the mvention may be 
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designed to recognize secondary structures, such as for example, hairpin structures 
resulting from intramolecular base pairing between nucleotides within an RNA molecule. 
The PTM molecule also contains a 3' splice region that includes a branch 
point, pyrimidine tract and a 3' splice acceptor AG site and/or a 5* splice donor site. 

5 Consensus sequences for the 5' splice donor site and the 3* splice region used in RNA 
splicing are well known in the art (See, Moore, et al, 1993, The RNA World, Cold 
Spring Harbor Laboratory Press, p. 303-358). In addition, modified consensus sequences 
that maintain the ability to function as 5' donor splice sites and 3' splice regions may be 
used in the practice of the invention. Briefly, the 5' splice site consensus sequence is 

1 0 AG/GURAGU (where A=adenosine, U=uracil, G=guanine, C=cytosine, R=purine and 
/=tbe splice site). The 3' splice site consists of three separate sequence elements: the 
branch point or branch site, a polypyrimidine tract and the 3* consensus sequence (YAG). 
The branch point consensus sequence in mammals is YNYURAC (Y=pyrimidine). The 
underiined A is the site of branch formation. A polypyrimidine tract is located between 

15 the branch point and the splice site acceptor and is important for different branch point 
utilization and 3' splice site recognition. 

Further, PTMs comprising a 3' acceptor site (AG) may be genetically 
engineered. Such PTMs may further comprise a pyrimidine tract and/or branch point 
sequence. 

20 Recently, pre-messenger RNA introns beginning with the dinucleotide AU 

and ending with the dinucleotide AC have been identified and referred to as U12 introns. 
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U12 intion sequences as well as any sequences that function as splice acceptor/donor 
sequences may also be used in PTMs. 

A spacer region to separate the RNA splice site from the target binding 
domain is also included in the PTM. The spacer region can have features such as stop 
5 codons which would block any translation of an unspUced PTM and/or sequences that 
enhance rra«y-splicii^ to the target pre-mRNA. 

In a preferred embodiment of the invention, a "safety" is also incorporated 
into the spacer, binding domain, or elsewhere in the PTM to prevent non-specific trans- 
splicing. This is a region of the PTM that covers elements of the 3' and/or 5' splice site of 
10 the PTM by relatively weak complementarity, preventing non-specific /ra/ts-splicing. 
The PTM is designed in such a way that upon hybridization of the binding /targeting 
portion(s) of the PTM, the 3' and/or 5'spUce site is uncovered and becomes fully active. 

The "safety" consists of one or more complementary stretches of ci5- 
sequence (or could be a second, separate, strand of nucleic acid) which weakly binds to 
15 one or both sides of the PTM branch pomt, pyrimidine tract, 3' splice site and/or 5' splice 
site (splicing elements), or could bind to parts of the splicing elements themselves. This 
"safety" binding prevents the splicing elements fi-om being active (i.e. block U2 snRNP 
or other splicing factors firom attaching to the PTM splice site recognition elements). The 
binding of the "safety" may be disrupted by the binding of the target binding region of the 
20 PTM to the target pre-mRNA, thus exposing and activating the PTM splicing elements 
(making them available to trems-spMce into the target pre-mRNA). 
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A nucleotide sequence encoding a translatable protein capable of 
producing an effect, such as cell death, or alternatively, one that restores a missing 
function or acts as a marker, is included in the PTM of the invention. For example, the 
nucleotide sequence can include those sequences encoding gene products missing or 
5 altered in known genetic diseases. Altematively, the nucleotide sequences can encode 
marker proteins or peptides v^hich may be used to identify or image cells. In yet another 
embodiment of the invention nucleotide sequences encoding affmity tags such as, HIS 
tags (6 consecutive histidine residues) (Janknecht, et al., 1991, Proc. Natl. Acad. Sci. 
USA 88:8972-8976), the C-terminus of glutathione-S-transferase (GST) (Smith and 
10 Johnson, 1986, Proc. Natl. Acad. Sci. USA 83:8703-8707) (Pharmacia) or FLAG (Asp- 
Tyr-Lys-Asp-Asp-Asp-Lys) (Eastman Kodak/IBI, Rochester, NY) can be included in 
PTM molecules for use in affinity purification. The use of PTMs containing such 
nucleotide sequences results in the production of a chimeric RNA encoding a fusion 
protein containing peptide sequences normally expressed in a cell linked to the peptide 
1 5 affinity tag. The affinity tag provides a method for the rapid purification and 

identification of peptide sequences expressed in the cell. In a preferred embodiment the 
nucleotide sequences may encode toxins or other proteins v^hich provide some function 
which enhances the susceptibihty of the cells to subsequent treatments, such as radiation 
or chemotherapy. 

20 In a highly preferred embodiment of the invention a PTM molecule is 

designed to contain nucleotide sequences encoding the Diphtheria toxin subunit A 
(Greenfield, L., et al., 1983, Proc. Natl. Acad. Sci. USA 80: 6853-6857). Diphtheria 
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toxin subunit A contains enzymatic toxin activity and will function if expressed or 
delivered into human cells resulting in cell death. Furthermore, various other known 
peptide toxins may be used in the present invention, including but not limited to, ricin, 
Pseudomonus toxin, Shiga toxin and exotoxin A. 

5 Additional features can be added to the PTM molecule either after, or 

before, the nucleotide sequence encoding a translatable protein, suchas polyadenylation 
signals or 5* splice sequences to enhance splicing, additional binding regions, "safety 
self complementary regions, additional splice sites, or protective groups to modulate the 
stability of the molecule and prevent degradation. 

1 0 Additional features that may be incorporated mto the PTMs of the 

mvention include stop codons or other elements in the region between the binding 
domain and the splice site to prevent unspliced pre-mRNA expression. In another 
embodiment of the invention, PTMs can be generated with a second anti-sense binding 
domain downstream from the nucleotide sequences encoding a translatable protein to 

1 5 promote bmding to the 3' target intron or exon and to block the fixed authentic 
splice site (U5 and/or Ul binding sites). 

PTMs may also be generated that require a double-rrow^-splicing reaction 
for generation of a chimeric rraw^-spliced product. Such PTMs could be used to replace 
an internal exon which could be used for RNA repair. PTMs designed to promote two 

20 rraw^-splicing reactions are engineered as described above, however, they contain both 5' 
donor sites and 3' spUce acceptor sites. In addition, the PTMs may comprise two or more 
binding domains and splicer regions. The splicer regions may be place between the 
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multiple binding domains and splice sites or alternatively between the multiple binding 
domains. 

Further elements such as a 3' hairpin structure, circularized RNA, 
nucleotide base modification, or a synthetic analog can be incorporated into PTMs to 
5 promote or facilitate nuclear localization and spliceosomal incorporation, and intra- 
cellular stability. 

Additionally, when engineering PTMs for use in plant cells it may not be 
necessary to include conserved branch point sequences or polypyrimidine tracts as these 
sequences may not be essential for intron processing in plants. However, a 3* splice 

10 acceptor site and/or 5' splice donor site, such as those required for splicing in vertebrates 
and yeast, will be included. Further, the efficiency of splicmg in plants may be increased 
by also including UA-rich intronic sequences. The skilled artisan v^U recognize that any 
sequences that are capable of mediating a /raw^-splicing reaction in plants may be used. 

The PTMs of the invention can be used in methods designed to produce a 

1 5 novel chimeric RNA m a target celL The methods of the present invention comprise 
delivering to the target cell a PTM which may be in any form used by one skilled in the 
art, for example, an RNA molecule, or a DNA vector which is transcribed into a RNA 
molecule, wherein said PTM binds to a pre-mRNA and mediates a /raw-splicing reaction 
resulth^ in formation of a chimeric RNA comprising a portion of the PTM molecule 

20 spliced to a portion of the pre-mRNA. 

5.2. SYNTHESIS OF THE TRANS-SPUCmG MOLECULES 
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The nucleic acid molecules of the invention can be RNA or DNA or 
derivatives or modified versions thereof, single-stranded or double-stranded. By nucleic 
acid is meant a PTM molecule or a nucleic acid molecule encoding a PTM molecule, 
whether composed of deoxyribonucleotides or ribonucleosides, and whether composed of 

5 phosphodiester Imkages or modified linkages. The term nucleic acid also specifically 
includes nucleic acids composed of bases other than the five biologically occurring bases 
(adenine, guanine, thymine, cytosine and uracil). 

The RNA and DNA molecules of the invention can be prepared by any 
method known in the art for the synthesis of DNA and RNA molecules. For example, the 

10 nucleic acids may be chemically synthesized using commercially available reagents and 
synthesizers by methods that are well known in the art (see, e.e.. Gait, 1985, 
Oligonucleotide Synthesis: A Practical Approach, IRL Press, Oxford, England). 
Alternatively, RNA molecules can be generated by in vitro and in vivo transcription of 
DNA sequences encoding the RNA molecule. Such DNA sequences can be incoqx>rated 

15 into a vwde variety of vectors which incorporate suitable RNA polymerase promoters 
such as the T7 or SP6 polymerase promoters. RNAs may be produced in high yield via 
in vitro transcription using plasmids such as SPS65 (Promega Corporation, Madison, 
WI). In addition, RNA amplification methods such as Q-P amplification can be utilized 
to produce RNAs. 

20 The nucleic acid molecules can be modified at the base moiety, sugar 

moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, transport into the cell, etc. For example, modification of a PTM to reduce 

hfY02 302146-1 "28" 




IB-A-B 072874.0135 



the overall charge can enhance the cellular uptake of the molecule. In addition 
modifications can be made to reduce susceptibility to nuclease degradation. The nucleic 
acid molecules may include other appended groups such as peptides (e.g., for targeting 
host cell receptors in v/ vo), or agents facilitating transport across the cell membrane (see, 
5 e.g., Letsinger et aL, 1989, Proc. Natl. Acad. Sci, U.S.A. 86:6553-6556; Lemaitre et al, 
1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/09810, published 
December 15, 1988) or the blood-brain barrier (see, e.g. , PCT Publication No. 
W089/10134, published April 25, 1988), hybridization-triggered cleavage agents. (See, 
e.g., Krol et al.^ 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 

10 1988, Pharm. Res. 5:539-549). To this end, the nucleic acid molecules may be 

conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking 
agent, transport agent, hybridization-triggered cleavage agent, etc. Various other well- 
known modifications to the nucleic acid molecules can be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include, but are not 

15 limited to, the addition of flanking sequences of ribo- or deoxy- nucleotides to the 5' 
and/or 3' ends of the molecule. In some circumstances where increased stability is 
desired, nucleic acids having modified intemucleoside linkages such as 2'-0-methylation 
may be preferred. Nucleic acids containing modified intemucleoside linkages may be 
synthesized using reagents and methods that are well known in the art (see, Uhlmann et 

20 a/., 1 990, Chem. Rev. 90:543-584; Schneider et al, 1990, Tetrahedron Lett. 31 :335 and 
references sited therein). 
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The nucleic acids may be purified by any suitable means, as are well 
known in the art. For example, the nucleic acids can be purified by reverse phase 
chromatography or gel electrophoresis. Of course, the skilled artisan will recognize that 
the method of purification will depend in part on the size of the nucleic acid to be 
purified. 

In instances where a nucleic acid molecule encoding a PTM is utilized, 
cloning techniques known in the art may be used for cloning of the nucleic acid molecule 
into an expression vector. Methods conmionly known in the art of recombinant DNA 
technology which can be used are described in Ausubel et aL (eds.), 1993, Current 
Protocols in Molecular Biology, John Wiley & Sons, NY; and Kriegler, 1990, Gene 
Transfer and Expression, A Laboratory Manual, Stockton Press, NY. 

The DNA encoding the PTM of interest may be recombinantly engineered 
into a variety of host vector systems that also provide for replication of the DNA in large 
scale and contain the necessary elements for directing the transcription of the PTM. The 
use of such a construct to transfect target cells in the patient will result in the transcription 
of sufficient amounts of PTMs that v^dll form complementary base pairs with the 
endogenously expressed pre-mRNA targets and thereby facilitate a rrara-splicing 
reaction between the complexed nucleic acid molecules. For example, a vector can be 
introduced in vivo such that it is taken up by a cell and directs the transcription of the 
PTM molecule. Such a vector can remain episomal or become chromosomally 
integrated, as long as it can be transcribed to produce the desired RNA. Such vectors can 
be constructed by recombinant DNA technology methods standard in the art 
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Vectors encoding the PTM of interest can be plasmid, viral, or others 
known in the art, used for replication and expression in mammalian cells. Expression of 
the sequence encoding the PTM can be regulated by any promoter known in the art to act 
in mammalian, preferably human cells. Such promoters can be inducible or constitutive. 

5 Such promoters include but are not limited to: the SV40 early promoter region (Benoist, 
C. and Chambon, P. 1981, Nature 290:304-310), the promoter contained in the 3* long 
terminal repeat of Rous sarcoma virus (Yamamoto et aly 1980, Cell 22:787-797), the 
herpes thymidine kinase promoter (Wagner et al, 1981, Proc. Natl. Acad. Sci. U.S.A. 
78:1441 1445), the regulatory sequences of the metallothionein gene (Brinster et al. 1982, 

10 Nature 296:39-42), the viral CMV promoter, the human chorionic gonadotropin-p 

promoter (HoUenberg et al., 1994, Mol. Cell. Endocrinology 106:1 11-119), etc. Any type 
of plasmid, cosmid, YAC or viral vector can be used to prepare the recombinant DNA 
construct which can be introduced directly into the tissue site. Alternatively, viral vectors 
can be used which selectively infect the desired target cell, 

1 5 For use of PTMs encoding peptide affinity purification tags, it is desirable 

to insert nucleotide sequences containing random target binding sites into the PTMs and 
clone them into a selectable mammalian expression vector system. A number of selection 
systems can be used, including but not limited to selection for expression of the herpes 
simplex virus thymidine kinase, hypoxanthine-guanine phosphoribosyltransterase and 

20 adenine phosphoribosyl tranferase protein in tic-, hgprt- or aprt- deficient cells, 

respectively. Also, anti-metabolic resistance can be used as the basis of selection for 
dihydrofolate tranferase {dhfr\ which confers resistance to methotrexate; xantiiine- 
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guanine phosphoribosyl transferase (gpt), which confers resistance to mycophenolic acid; 
neomycm {neo\ which confers resistance to aminoglycoside G-41 8; and hygromycin B 
phosphotransferase Qiy^o) which confers resistance to hygromycin. hx a preferred 
embodiment of the invention, the cell culture is transformed at a low ratio of vector to 
5 cell such that there will be only a single vector, or a limited number of vectors, present in 
any one cell. Vectors for use in the practice of the invention include any eukaryotic 
expression vectors, including but not limited to viral expression vectors such as those 
derived Ifrom the class of retroviruses or adeno-associated viruses. 

5.3. USES AND ADMINISTRATION OF rjL4A^5-SPLICING MOLECU LES 

10 5.3.1. USE OF PTM MOLECULES FOR GENE REGULA'nON, GENE 

REPAIR AND TARGETED CELL DEATH 

The compositions and methods of the present invention v^^ill have a variety 
of different applications including gene regulation, gene repair and targeted cell death. 
For example, rra«5-splicing can be used to introduce a protein with toxic properties into 
1 5 a cell. In addition, PTMs can be engineered to bind to viral mRNA and destroy the 
function of the viral mRNA, or alternatively, to destroy any cell expressing the viral 
mRNA. In yet another embodiment of the invention, PTMs can be engineered to place a 
stop codon in a deleterious mRNA transcript thereby decreasing the expression of that 
transcript 

20 Targeted rrara-splicing, including double-Zrans-splicing reactions, can be 

used to repair or correct transcripts that are either truncated or contain point mutations. 
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The PTMs of the invention are designed to cleave a targeted transcript upstream or 
downstream of a specific mutation or upstream of a premature 3' and correct the mutant 
transcript via a /ranj-splicing reaction which replaces the portion of the transcript 
containing the mutation with a functional sequence. 

5 In addition, double fran5-splicing reactions may be used for the selective 

expression of a toxin in tumor cells. For example, PTMs can be designed to replace the 
second exon of the human p-chronic gonadotropin-6 (phCG6) gene transcripts and to 
deliver an exon encoding the subunit A of diptheria toxin (DT-A). Expression of DT-A 
in the absence of subunit B should lead to toxicity only in the cells expressing the gene. 

1 0 phCG6 is a prototypical target for genetic modification by trans-splicing. The sequence 
and the structure of the phCG6 gene are completely known and the pattern of splicing has 
been determined. The phCG6 gene is highly expressed m many types of solid tumors, 
including many non-germ line tumors, but the phCG6 gene is silent in the majority cells 
in a normal adult Therefore, the phCG6 pre-mRNA represents a desirable target for a 

15 trans-splicing reaction designed to produce tumor-specific toxicity. 

The first exon of phCG6 pre-mRNA is ideal in that it encodes only five 
amino acids, includmg the initiator AUG, which should result in minimal mterference 
with the proper folding of the DT-A toxin while providing the required signals for 
effective translation of the trans-spliced mRNA. The DT-A exon, which is designed to 

20 include a stop codon to prevent chimeric protein formation, will be engineered to trans- 
splice into the last exon of the phCG6 gene. The last exon of the phCG6 gene provides 
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the construct with the appropriate signals to poly adenylate the mRNA and ensure 
translation. 

Cystic fibrosis (CF) is one of the most common fatal genetic disease in 
humans. Based on both genetic and molecular analyses, the gene associated with cystic 

5 fibrosis has been isolated and its protein product deduced (Kerem, B,S. et al., 1989, 
Science 245:1073-1080; Riordan et al., 1989, Science 245: 1066-1 073 ;Rommans, et al., 
1989, Science 245:1059-1065). The protein product of the CF associated gene is called 
the cystic fibrosis transmembrane conductance regulator (CFTR). In a specific 
embodiment of the invention, a /ram-splicing reaction will be used to correct a genetic 

10 defect in the DNA sequence encoding the cystic fibrosis transmembrane regulator 
(CFTR) whereby the DNA sequence encoding the cystic fibrosis trans-membrane 
regulator protein is expressed and a fiinctional chloride ion channel is produced in the 
airway epithelial cells of a patient. 

Population studies have indicated that the most conmion cystic fibrosis 

1 5 mutation is a deletion of the three nucleotides in exon 1 0 that encode phenylalanine at 
position 508 of the CFTR amino acid sequence. As indicated in Figure 15, a trans- 
splicing reaction was capable of correcting the deletion at position 508 in the CFTR 
amino acid sequence. The PTM used for correction of the genetic defect contained a 
CFTR BD intron 9 sequence, a spacer sequence, a branch point, a polypyrimidine tract, a 

20 3* splice site and a wild type CFTR BD exon 10 sequence (Figure 13), The successfiil 
correction of the mutated DNA encoding CFTR utiliang a rraw5-splicing reaction 
supports the general application of PTMs for correction of genetic defects. 
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The methods and compositions of the invention may also be used to 
regulate gene expression in plants. For example, trans-splicing may be used to place the 
expression of any engineered gene under the natural regulation of a chosen target plant 
gene, thereby regulating the expression of the engineered gene. rra/w-splicLng may also 
be used to prevent the expression of engineered genes in non-host plants or to convert an 
endogenous gene product into a more desirable product. 

In a specific embodiment of the invention /ra«-5plicing may be used to 
regulate the expression of the insecticidal gene that produces Bt toxin (Bacillus 
thuringiensis). For example, the PTM may be designed to ^raw-splice into an injury 
response gene (pre-mRNA) that is expressed only after an insect bites the plant. Thus, all 
cells of the plant would carry the gene for Bt in the PTM, but the cells would only 
produce Bt when and where an insect injures the plant. The rest of the plant will make 
little or no Bt. A PTM could trans-splicc the Bt gene into any chosen gene with a desired 
pattem of expression. Fiirther, it should be possible to target a PTM so that no Bt is 
produced in the edible portion of the plant. 

One advantage associated with the use of PTMs is that the PTM acquires 
the native gene control elements of the target gene, thus, reducing the time and effort that 
might otherwise be spent attempting to identify and reconstitute appropriate regulatory 
sequences upstream of an engineered gene. Thus, expression of the PTM regulated gene 
should occur only in those plant cells containing the target pre-mRNA. By targeting a 
gene not expressed in the edible portion of the plant or in the pollen, trans-splicing can 
alleviate opposition to genetically modified plants, as consumers would not be eating the 
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proteins made from modified genes. The edible portion of such crops should test 
negative for genetically modified proteins. 

In addition, PTM can be targeted to a unique sequence of the host gene 
that is not present in other plants. Therefore, even if the gene (DNA) which encodes the 

5 PTM jumps to another species of plant, the PTM gene will not have an appropriate target 
for rrara-splicing. Thus, rran5r-splicing offers a "fail-safe'* mode for prevention of gene 
"jumping" to other plant species: the PTM gene v^U be expressed only in the engineered 
host plant, which contains the appropriate target pre-mRNA. Expression in non- 
engineered plants would not be possible. 

10 Trflw^-splicing also provides a more efficient way to convert one gene 

product into another. For example, /r^m-splicing ribozymes and chimeric oligos can be 
incorporated into com genomes to modify the ratio of saturated to unsaturated oils. 
rran5-splicing can also be used to convert one gene product into another. 

Various delivery systems are known and can be used to transfer the 

1 5 compositions of the invention mto cells, e.g, encapsulation in liposomes, microparticles, 
microcapsules, recombinant cells capable of expressing the composition, receptor- 
mediated endocytosis (see, e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432), 
construction of a nucleic acid as part of a retroviral or other vector, injection of DNA, 
electroporation, calcium phosphate mediated transfection, etc. 

20 The compositions and methods can be used to treat cancer and other 

serious viral infections, autoimmune disorders, and other pathological conditions in 
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which the alteration or elimination of a specific cell type would be beneficial. 
Additionally, the compositions and methods may also be used to provide a gene encoding 
a fimctional biologically active molecule to cells of an individual with an inherited 
genetic disorder where expression of the missing or mutant gene product produces a 
normal phenotype. 

In a preferred embodiment, nucleic acids comprising a sequence encoding 
a PTM are administered to promote PTM function, by way of gene delivery and 
expression into a host cell. In this embodiment of the invention, the nucleic acid 
mediates an effect by promoting PTM production. Any of the methods for gene delivery 
into a host cell available in the art can be used according to the present mvention. For 
general reviews of the methods of gene delivery see Strauss, M. and Barranger, J. A., 
1997, Concepts in Gene Therapy, by Walter de Gruyter & Co., Berlin; Goldspiel et aL, 
1993, Clinical Phannacy 12:488-505; Wu and Wu, 1991, Biotherapy 3:87-95; 
Tolstoshev, 1993, Ann. Rev. Pharmacol. Toxicol. 33:573-596; Mulligan, 1993, Science 
260:926-932; and Morgan and Anderson, 1993, Ann. Rev. Biochem. 62:191-217; 1993, 
TIBTECH 1 1(5):155-215. Exemplary methods are described below. 

Delivery of the nucleic acid into a host cell may be either direct, in which 
case the host is directiy exposed to the nucleic acid or nucleic acid-carrying vector, or 
indirect, in which case, host cells are first transformed with the nucleic acid in vitro^ then 
transplanted into the host. These two approaches are known, respectively, as in vivo or ex 
vivo gene delivery. 
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In a specific embodiment, the nucleic acid is directly administered in vivo, 
where it is expressed to produce the PTM. This can be accomplished by any of numerous 
methods known in the art, e.g., by constructing it as part of an appropriate nucleic acid 
expression vector and administering it so that it becomes mtracellular, e.g. by infection 

5 using a defective or attenuated retroviral or other viral vector (see U.S. Patent No. 

4,980,286), or by direct injection of naked DNA, or by use of microparticle bombardment 
(e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface receptors or 
transfecting agents, encapsulation in liposomes, microparticles, or microcapsules, or by 
administering it in linkage to a peptide which is known to enter the nucleus, by 

10 administering it in linkage to a Ugand subject to receptor-mediated endocytosis (see e.g, , 
Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432). 

In a specific embodiment, a viral vector that contains the PTM can be 
used. For example, a retroviral vector can be utilized that has been modified to delete 
retroviral sequences that are not necessary for packaging of the viral genome and 

15 integration into host cell DNA (see Miller et al, 1993, Meth. Enzymol. 217:581-599). 
Alternatively, adenoviral or adeno-associated viral vectors can be used for gene delivery 
to cells or tissues. (See, Kozarsky and Wilson, 1993, Current Opinion in Genetics and 
Development 3:499-503 for a review of adenovirus-based gene delivery). 

Another approach to gene delivery into a cell involves transferring a gene 

20 to cells in tissue culture by such methods as electroporation, lipofection, calcium 
phosphate mediated transfection, or viral infection. Usually, the method of transfer 
includes the transfer of a selectable marker to the cells. The cells are then placed under 
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selection to isolate those cells that have taken up and are expressing the transferred gene. 
The resulting recombinant cells can be delivered to a host by various methods known in 
the art. In a preferred embodiment, the cell used for gene delivery is autologous to the 
host cell. 

5 The present invention also provides for pharmaceutical compositions 

comprising an effective amount of a PTM or a nucleic acid encoding a PTM, and a 
pharmaceutically acceptable carrier. In a specific embodiment, the term 
"pharmaceutically acceptable" means approved by a regulatory agency of the Federal or a 
state government or listed in the U.S. Pharmacopeia or other generally recognized 

10 pharmacopeia for use in animals, and more particularly in humans. The term "carrier" 
refers to a diluent, adjuvant, excipient, or vehicle v^rith which the therapeutic is 
administered. Examples of suitable pharmaceutical carriers are described in 
"Remington's Pharmaceutical sciences" by E.W. Martin. 

In specific embodiments, pharmaceutical compositions are administered: 

15 (1 ) in diseases or disorders involving an absence or decreased (relative to normal or 
desired) level of an endogenous protein or function, for example, in hosts where the 
protein is lacking, genetically defective, biologically inactive or underactive, or under 
expressed; or (2) in diseases or disorders whereia in vitro or in vivo, assays indicate the 
utility of PTMs tiaat inhibit the function of a particular protein . The activity of the 

20 protein encoded for by the chimeric mRNA resulting from the PTM mediated trans- 
splicing reaction can be readily detected, e.g., by obtaining a host tissue sample (e.g., 
from biopsy tissue) and assaying it in vitro for mRNA or protein levels, structure and/or 
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activity of the expressed chimeric mRNA. Many methods standard in the art can be thus 
employed, including but not limited to unmunoassays to detect and/or visualize the 
protein encoded for by the chimeric mRNA (e.g. , Western blot, immunoprecipitation 
followed by sodium dodecyl sulfate polyacrylamide gel electrophoresis, 
5 immunocytochemistry , etc.) and/or hybridization assays to detect formation of chimeric 
mRNA expression by detecting and/or visualizing the presence of chimeric mRNA {e.g.. 
Northern assays, dot blots, in situ hybridization, and Reverse-Transcription PGR, etc.), 
etc. 

The present invention also provides for pharmaceutical compositions 
1 0 comprising an effective amount of a PTM or a nucleic acid encoding a PTM, and a 
pharmaceutically acceptable carrier. In a specific embodiment, the term 
"pharmaceutically acceptable" means approved by a regulatory agency of the Federal or a 
state government or listed in the U,S. Pharmacopeia or other generally recognized 
pharmacopeia for use in animals, and more particularly in humans. The term "carrier" 
1 5 refers to a diluent, adjuvant, excipient, or vehicle with which the therapeutic is 
administered. Examples of suitable pharmaceutical carriers are described in 
"Remington's Pharmaceutical sciences" by E.W. Martin. In a specific embodiment, it may 
be desirable to administer the pharmaceutical compositions of the invention locally to the 
area in need of treatment. This may be achieved by, for example, and not by way of 
20 limitation, local infiision during surgery, topical application, e.g. , in conjunction with a 
wound dressing after surgery, by injection, by means ofa catheter, by means of a 
suppository, or by means of an implant, said implant being ofa porous, non-porous, or 
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gelatinous material, including membranes, such as sialastic membranes, or fibers. Other 
control release drug delivery systems, such as nanoparticles, matrices such as controlled- 
release polymers, hydrogels. 

The PTM will be administered in amounts which are effective to produce 
the desired effect in the targeted cell. Effective dosages of the PTMs can be determined 
through procedures well known to those in the art which address such parameters as 
biological half-life, bioavailability and toxicity. The amount of the composition of the 
invention which will be effective will depend on the nature of the disease or disorder 
being treated, and can be determined by standard clinical techniques. In addition, in vitro 
assays may optionally be employed to help identify optimal dosage ranges. 

The present invention also provides a pharmaceutical pack or kit 
comprising one or more containers filled with one or more of the ingredients of the 
pharmaceutical compositions of the invention optionally associated with such 
container(s) can be a notice in the form prescribed by a govermnental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological products, which notice 
reflects approval by the agency of manufacture, use or sale for human administration. 

5.3.2. USE OF PTM MOLECULES FOR EXON TAGGING 
In view of current efforts to sequence and characterize the genomes of 
humans and other organisms, there is a need for methods that facilitate such 
characterization. A majority of the information currently obtained by genomic mapping 
and sequencing is derived firom complementary DNA (cDNA) libraries, which are made 
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by reverse transcription of mRNA into cDNA. Unfortunately, this process causes the loss 
of information concerning intron sequences and the location of exon/intron boundaries. 

The present invention encompasses a method for mapping exon-intron 
boundaries in pre-mRNA molecules comprising (i) contacting a pre-trans-splicing 
5 molecule with a pre-mRNA molecule under conditions in which a portion of the pre- 
trans-splicing molecule is trans-spliced to a portion of the target pre-mRNA to form a 
chimeric mRNA; (ii) amplifying the chimeric mRNA molecule; (iii) selectively 
purifying the amplified molecule; and (iv) determining the nucleotide sequence of the 
amplified molecule thereby identifying the intron-exon boundaries. 
10 In an embodiment of the present invention, PTMs can be used in trans- 

splicing reactions to locate exon-intron boundaries in pre-mKNAs molecules. PTMs for 
use in mappii^ of intron-exon boundaries have structures similar to those described 
above in Section 5.1. Specifically, the PTMs contain (i) a target binding domain that is 
designed to bind to many pre-mRNAs: (ii) a 3' splice region that includes a branch point, 
1 5 pyrimidine tract and a 3' splice acceptor site, or a 5' splice donor site; (iii) a spacer region 
that separates the mRNA splice site from the target binding domain; and (iv) a tag region 
that will be trans-spliced onto a pie-mRNA. Alternatively, the PTMs to be used to locate 
exon-intron boundaries may be engineered to contain no target binding domain. 

For purposes of intron-exon mapping, the PTMs are genetically 
20 engineered to contain target binding domains comprising random nucleotide sequences. 
The random nucleotide sequences contain at least 15-30 and up to several hundred 
nucleotide sequences capable of binding and anchoring a pre-mRNA so that the 
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spliceosome processing machinery of the nucleus can trans-splice a portion (tag or 
marker region) of the PTM to a portion of the pre-mRNA. PTMs containing short target 
binding domams, or containing inosines bind under less stringent conditions to the pre- 
mRNA molecules. In addition, strong branch point sequences and pyrimidine tracts serve 

5 to increase the non-specificity of PTM trans-splicmg. 

The random nucleotide sequences used as target binding domains in the 
PTM molecules can be generated using a variety of different methods, including, but not 
limited to, partial digestion of DNA with restriction endonucleases or mechanical 
shearing of the DNA. The use of such random nucleotide sequences is designed to 

10 generate a vast array of PTM molecules with different binding activities for each target 
pre-mRNA expressed m a cell. Randomized libraries of oligonucleotides can be 
synthesized with appropriate restriction endonucleases recognition sites on each end for 
cloning into PTM molecules genetically engineered into plasmid vectors. When the 
randomized oligonucleotides are litigated and expressed, a randomized binding library of 

15 PTMs is generated. 

In a specific embodiment of the invention, an expression library encoding 
PTM molecules containing target binding domains comprising random nucleotide 
sequences can be generated using a variety of methods which are well known to those of 
skill in the art. Ideally, the library is complex enough to contain PTM molecules capable 
20 of interacting with each target pre-mRNA expressed in a cell 

By way of example. Figure 9 is a schematic representation of two forms of 
PTMs which can be utilized to map intron-exon boundaries. The PTM on the left is 
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capable of non-specifically trans-splicing into a pre-mRNA 3' splice site, while the PTM 
on the right is capable of trans-splicing into a pre-mRNA 5' splice site. Trans-splicing 
between the PTM and the target pre-mRNA results in the production of a chimeric 
mRNA molecule having a specific nucleotide sequence "tag" on either the 3' or 5' end of 

5 an authentic exon. 

Following selective purification, a DNA sequencing reaction is then 
performed using a primer which begins in the tag nucleotide sequence of the PTM and 
proceeds into the sequence of the tagged exon. The sequence immediately following the 
last nucleotide of the tag nucleotide sequence represents an exon boundary. For 

10 identification of intron-exon tags, the trans-splicing reactions of the invention can be 

performed either in vitro or in vivo using methods well known to those of skill in the art 

5.3.3. USE OF PTM MOLECULES FOR IDENTIFICATION OF 
PROTEINS EXPRESSED IN A CELL 

In yet another embodiment of the invention, PTM mediated trans-splicing 

1 5 reactions can be used to identify previously undetected and unknown proteins expressed 

in a cell. This metiiod is especially usefiU for identification of proteins tiiat cannot be 

detected by a two-dimensional electrophoresis, or by otiier methods, due to inter alia the 

small size of tiie protein, low concentration of tiie protein, or failure to detect the protein 

due to similar migration patterns with other protems in two-dimensional electrophoresis. 

20 The present invention relates to a method for identifying proteins 

expressed in a cell comprising (i) contacting a pre-tians-splicing molecule containing a 



NY02.302146.1 




i-A-B 072874.0135 



random target binding domain and a nucleotide sequence encoding a peptide tag with a 
pre-mRNA molecule under conditions in which a portion of the pre-trans-splicing 
molecule is trans-spliced to a portion of the target pre-mRNA to form a chimeric mRNA 
encoding a fusion polypeptide or separating it by gel electrophoresis (ii) affinity purifying 
5 the fusion polypeptide; and (iii) determining the amino acid sequence of the fusion 
protein. 

To identify proteins expressed in a cell, the PTMs of the invention are 
genetically engineered to contain: (i) a target binding domain comprising randomized 
nucleotide sequences; (ii) a 3* splice region that includes a branch point, pyrimidine tract 

10 and a 3* splice acceptor site and/or a 5' splice donor site; (iii) a spacer region that 
separates the PTM splice site from the target binding domain; and (iv) nucleotide 
sequences encodmg a marker or peptide affmity purification tag. Such peptide tags 
include, but are not limited to, HIS tags (6 histidine consecutive residues) (Janknecht, 
et al., 1991 Proc. Natl. Acad. Sci. USA 88:8972-8976), glutathione-S-transferase (GST) 

1 5 (Smith, D.B. and Johnson K.S., 1 988, Gene 67:3 1) (Pharmacia) or FLAG (Kodak/IBI) 
tags (Nisson, J. et al. J. Mol. Recognit, 1996, 5:585-594) 

rraw5-splicing reactions using such PTMs results in the generation of 
chimeric mRNA molecules encoding fusion proteins comprising protein sequences 
normally expressed in a cell linked to a marker or peptide affinity purification tag. The 

20 desired goal of such a method is that every protein synthesized in a cell receives a marker 
or peptide affinity tag thereby providing a method for identifying each protein expressed 
in a cell. 
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In a specific embodiment of the invention, PTM expression libraries 
encoding PTMs having different target binding domains comprising random nucleotide 
sequences are generated. The desired goal is to create a PTM expression library that is 
complex enough to produce a PTM capable of binding to each pre-mRNA expressed in a 
5 cell. In a preferred embodiment, the library is cloned into a mammalian expression vector 
that results in one, or at most, a few vectors being present in any one cell. 

To identify the expression of chimeric proteins, host cells are transformed 
with the PTM library and plated so that individual colonies containing one PTM vector 
can be grown and purified. Single colonies are selected, isolated, and propagated in the 
1 0 appropriate media and the labeled chimeric protein exon(s) fragments are separated away 
from other cellular proteins using, for example, an affinity purification tag. For example, 
affinity chromatography can involve the use of antibodies that specifically bmd to a 
peptide tag such as the FLAG tag. Altematively, when utilizing HIS tags, the fusion 
proteins are purified using a Ni^^ nitriloacetic acid agarose columns, which allows 
1 5 selective elution of bound peptide eluted with imidazole containing buffers. When using 
GST tags, the fiision proteins are purified using glutathione-S-transferase agarose beads. 
The fusion proteins can then be eluted in the presence of free glutathione. 

FoUowdng purification of the chimeric protein, an analysis is carried out 
to determine the ammo acid sequence of the fusion protein. The amino acid sequence of 
20 the fiision protein is determined using techniques well known to those of skill m the art, 
such as Edman Degradation followed by amino acid analysis using HPLC, mass 
spectrometry or an amino acid analyzation. Once identified, the peptide sequence is 
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compared to those sequences available in protein databases, such as GenBank. If the 
partial peptide sequence is already known, no further analysis is done. If the partial 
protein sequence is unknown, then a more complete sequence of that protein can be 
carried out to determine the full protein sequence. Since the fusion protein will contain 
5 only a portion of the full length protein, a nucleic acid encodmg the full length protein 
can be isolated using conventional methods. For example, based on the partial protein 
sequence oligonucleotide primers can be generated for use as probes or PGR primers to 
screen a cDNA library. 

6. EXAMPLE: PRODUCTION O F TP4NS-^P\JCTNG MOLEGULES 
10 The following section describes the production of PTMs and the 

demonstration that such molecules are capable of mediating rrans-splicing reactions 
resulting in the production of chimeric mRNA molecules. 



6.1. MATERIALS AND METHODS 
6.1.1. CONSTRUGTION OF PRE-mRNA MOLEGULES 
1 5 Plasmids containing the wild type diphtheria toxin subunit A (DT-A, wild- 

type accession #K01722) and a DT-A mutant (GRM 197, no enzymatic activity) were 
obtained from Dr. Virginia Johnson, Food and Drug Administration, Bethesda, Maryland 
(Uchida et aL, 1973 J, Biol. Ghem 248:3838). For in vitro experiments, DT-A was 
amplified using primers: DT-IF (5'-GGGGGTGGAGGGGGGTGATGATGTTGTTG); 
20 and DT-2R (5'-GGCGAAG CITGGATCCGACACGATTTCCTGCACAGG), cut with 
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PstI and ffindlll, and cloned into PsU and Hindlll digested pBS(-) vector (Stratagene, 
La JoUa, CA). The resulting clone, pDTA was used to construct the individual PTMs. 
(1) pPTM+: Targeted construct. Created by inserting IN3-1 (5'AATTCTCTAGATGCTT 
CACCCGGGCCTGACTCGAGTACTAACTGOTACCTCTTCTTTTTTTTCCTGCA) 
5 and IN2-4 (S'-GGAAAAAAAAGAAGAGGTACCAGTTAGTACTCGAGTCAGG 
CCCGGGTGAAGCATCTAGAG) primers into EcoRI and Pstl digested pDTA. (2) 
pPTM+Sp: As pPTM+ but with a 30 bp spacer sequence between the BD and BP. 
Created by digesting pPTM+ with Xhol and ligating in the oligonucleotides, spacer S (5- 
TCGAGCAACGTTATAATAATGTTC) and spacer AS (5'-TCGAGAACATTATT 
1 0 ATAACGTTGC). For in vivo studies, an EcoRI and Hindi!! fragment of pcPTM+Sp was 
cloned into mammalian expression vector pcDNA3.1 (Invitrogen), under the control of a 
CMV promoter. Also, the methionine at codon 14 was changed into isoleucine to prevent 
initiation of translation. The resulting plasmid was designated as pcPTM+Sp. (3) 
pPTM+CRM: As pPTM+Sp but the wild type DT-A was substituted with CRM mutant 
1 5 DT-A (T. Uchida, et al., 1973, J. Biol. Chem. 248:3838). This was created by PCR 
amplification of a DT-A mutant (mutation at G52E) using primers DT-IF and DT-2R. 
For in vivo studies, an EcoM Hind!!! fragment of PTM+CRM was cloned into pc3.1DNA 
that resulted in pcPTM+ARM. (4) PTM-: Non-targeted consfruct. Created by digestion 
of PTM+ with Ecoia and Pst I, gel purified to remove the binding domain followed by 
20 ligation of the oligonucleotides, IN-5 (5'-ATCTCTAGATCAGGCCCGGGTGAAGCC 
CGAG) and IN-6 (5'-TGCTTCACCC GGGCCTGATCTAGAG). (5) PTM-Sp, is an 
identical version of the PTM-, except it has a 30 bp spacer sequence at the Pstl site. 
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Similarly, the splice mutants [Py(-)AG(-) and BP(-)Py(-)AG(-)] and safety variants 
[PTM+SF-Pyl, PTM+SF-Py2, PTM+SFBP3 and PTM+SFBP3-Pyll were constructed 
either by insertion or deletion of specific sequences (see Table 1). 



Table 1. Binding/non-binding domain, BP, PPT and 3' as sequences of different PTN 


fs 


PTM construct 


BD/NBD 


BP 


PPT 


3 SS 


PTM+Sp (targeted) 


:TGCTTCACCCGGGCCTGA 


TACTAAC 


CTCrTCTTTTTTTTCC 


TAG 


PTM-Sp (non-targeted) 


:CAACGTTATAATAATGTT 


TACTAAC 


CTCTTCTTTTITTTCC 


CAG 


PTM+Py (-)AG(-)BP(-) 


iTGCTTCACCCGGGCCTGA 


GGCTGAT 


CTGTGATTAATAGCGG 


ACG 


PTM+Py(-)AG(-) 


:TGCTTCACCCGGGCCTGA 


TACTAAC 


CCTGGACGCGGAAGTT 


ACG 


PTM+SF 


iCTGGGACAAGGACACTGCTT 
CACCCGGTTAGTAGACCACA 
GCCCTGAAGCC 


TACTAAC 


CTTCTGTTTTTTTCTC 


CAG 


PTM+SF-Py 


:As in PTM+SF 


TACTAAC 


CTTCTGTATTATTCTC 


CAG 


PTM+SF-Py 


:As in PTM+SF 


TACTAAC 


GTTCTGTCCTTGTCTC 


CAG 


PTM+SF-BP3 


:As in PTM+SF 


TGCTGAC 


CTTCTGTTTmTCTC 


CAG 


PTM+SFBP3-Py 


:As in PTM+SF 


TGCTGAC 


CTTCTGTATTATTCTC 


CAG 



1 5 Nucleotides in bold indicate the mutations compared to normal BP, PPT and 3' splice site. 

Branch site A is underlined. The nucleotides in italics indicates the mismatch introduced 
into safety BD to mask the BP sequence in the PTM. 

A double-/raw-splicing PTM construct (DS-PTM) was also made adding 
a 5' splice site and a second target binding domain complementary to the second intron of 
20 PHCG pre-mRNA to the 3' end of the toxin coding sequence of PTM+SF (Figure A). 

6.1.2, pHCG6 TARGET PRE-mRNA 
To produce the in vitro target pre-mRNA, a Sad fragment of PHCG gene 
6 (accession #X00266) was cloned into pBS(-). This produced an 805 bp insert from 
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nucleotide 460 to 1265, which includes the 5' untranslated region, initiation codon, 
exon 1, intron 1, exon 2, and most of intron 2. For in vivo studies, an EcoRI and BamHI 
fragment was cloned into mammalian expression vector (pc3.1DNA), producing PHCG6. 

6.1.3. mRNA PREPARATION 
5 For in vitro splicing experiments, PHCG6, p-globin pre-mRNA and 

different PTM mRNAs were synthesized by in vitro transcription of BamHI and Hindlll 
digested plasmid DNAs respectively, using T7 mRNA polymerase (Pasman & Garcia- 
Blanco, 1996, Nucleic Acids Res. 24:1638). Synthesized mRNAs were purified by 
electrophoresis on a denaturing polyacrylamide gel, and the products were excised and 
10 eluted. 

6.1.4 TM VITRO SPLICING 
PTMs and target pre-mRNA were annealed by heating at 98°C followed 
by slow cooling to 30-34°C. Each reaction contained 4 ^il of annealed mRNA complex 
(100 ng of target and 200 ng of PTM), IX splice buffer (2 mM MgClj, 1 mM ATP, 5 

1 5 mM creatinine phosphate, and 40 mM KCI) and 4 ^l of HeLa splice nuclear extract 

(Promega) in a 12.5 jil final volume. Reactions were incubated at 30*C for the indicated 
times and stopped by the addition of an equal volume of high salt buffer (7 M urea, 5% 
SDS, 100 mM LiCl, 10 mM EDTA and 10 mM TrisHCI, pH 7.5). Nucleic acids were 
purified by extraction with phenol:chloroform:isoamyl alcohol (50:49: 1) followed by 

20 ethanol precipitation. 
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6.1.5. REVERSE TRANSCRIPTION-PCR REACTIONS 
RT-PCR analysis was perfonned using EZ-RT PGR kit (Perkin-Elmer, 
Foster City, CA). Each reaction contained 10 ng of cis- or rrora-spliced mRNA, or 
1-2 Jig of total mRNA, 0.1 \i\ of each 3' and 5' specific primer, 0.3 mM of each dNTP, IX 
EZ buffer (50 mM bicine, 1 15 mM potassium acetate, 4% glycerol, pH 8.2), 2.5 raM 
magnesium acetate and 5 U of xTth DNA polymerase in a 50 nl reaction volume. 
Reverse transcription was perfonned at 60°C for 45 min followed by PGR amplification 
of the resulting cDNA as follows: one cycle of initial denaturation at 94°C for 30 sec, and 
25 cycles of denaturation at 94X for 18 sec and annealing and extension at 60°C for 
40 sec, followed by a 7 min final extension at 70°C. Reaction products were separated by 
electrophoresis in agarose gels. 

Primers used in the study were as follows: 
DT-IF: GGCGCTGCAGGGCGCTGATGATGTTGTTG 
DT-2R:GGCGAAGCTTGGATCCGACACGATTTCCTGCACAGG 
DT-3R: CATCGTCATAATTTCCTTGTG 
DT-4R: ATGGAATCTACATAACCAGG 
DT-5R: GAAGGCTGAGCACTACACGC 
HCG-R2: CGGCACCGTGGCCGAAGTGG, 
Bio-HCG-F:ACCGGAATTCATGAAGCCAGGTACACCAGG 

p-globulin-F: GGGCAAGGTGAACGTGGATG 
p-globulin-R: ATCAGGAGTGGACAGATCC 
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6.1.6. CELL GROWTH. TRANSFECTION AND mRNA ISOLATION 
Human lung cancer cell line H1299 (ATCC accession # CRL-5803) was 
grown in RPMI medium supplemented with 10% fetal bovine serum at 3TC in a 5% CO2 
environment. Cells were transfected with pcSp+CRM (CRM is a non-functional toxin), a 
5 vector expressing a PTM, or vector alone (pcDNA3. 1) using lipofectamine reagent (Life 
Technologies, Gaithersburg, MD). The assay was scored for neomycin resistance (neo^ 
colony formation two weeks after transfection. Four neo' colonies were selected and 
expanded under continued neo selection. Total cellular mRNA was isolated using RNA 
exol (BioChain Institute, Inc., San Leandro, CA) and used for RT-PCR. 



10 6.1.7. TTg^iVy-SPLICING IN TUMORS IN NUDE MICE 

Eleven nude mice were bilaterally injected (except BIO, Bl 1 and B12 had 
1 tumor) into the dorsal flank subcutaneous space with 1 x 10^ HI 299 human lung tumor 
cells (day 1). On day 14, the mice were given an appropriate dose of anesthesia and 
injected with, or without electroporation (T820, BTX Inc., San Diego, CA) in several 

1 5 orientations with a total volume of 1 00 fA of saline containing 1 00 pcSp+CRM with 
or without pcpHCG6 or pcPTM+Sp. Solutions injected into the right side tumors also 
contained India ink to mark needle tracks. The animals were sacrificed 48 hours later and 
the tumor excised and immediately frozen at -80 "C. For analysis, 10 mg of each tumor 
was homogenized and mRNA was isolated using a Dynabeads mRNA direct kit (Dynal) 

20 followmg the manufacturers directions. Purified mRNA (2 ;zl of 10 pel total volume) was 
subjected to RT-PCR using PHCG-F and DT-5R primers as described earlier. All 
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samples were re-amplified using DT-3R, a nested DT-A primer and biotinylated pHCG-F 
and the products were analyzed by electrophoresis on a 2% agarose gel. Samples that 
produced a band were processed into single stranded DNA using M280 Streptavidin 
Dynabeads and sequenced using a toxin specific primer (DT-3R). 

5 6.2. RESULTS 

6.2.1. SYNTHESIS OF PTM 
A prototypical rran^-splicing mRNA molecule, pcPTM+Sp (Figure 1 A) 
was constructed that included: an 18 nt target bmding domain (complementary to PHCG6 
intron 1), a 30 nucleotide spacer region, branch point (BP) sequence, a polypyrimidine 

10 tract (PPT) and an AG dinucleotide at the 3* splice site immediately upstream of an exon 
encoding diphtheria toxin subunit A (DT-A) (Uchida et al , 1973, J. BioL Chem. 
248:3838). Later DT-A exons were modified to eliminate translation initiation sites at 
codon 14. The PTM constructs were designed for maximal activity in order to 
demonstrate fran^-splicing; therefore, they included potent 3' splice elements (yeast BP 

15 and a mammalian PPT) (Moore et aL, 1993, In The mRNA World, R.F. Gesteland and 
J.F. Atkins, eds. (Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press). 
pHCG6 pre-mRNA (Talmadge et al, 1984, Nucleic Acids Res. 12:8415) was chosen as a 
model target as this gene is expressed in most tumor cells. It is not expressed in normal 
adult cells, with the exception of some in the pituitary gland and gonads. (Acevedo et al, 

20 1992, Cancer 76: 1467; Hoon et al, 1 996, Int J. Cancer 69:369; Bellet et a/., 1997, Cancer 
Res. 57:516). As shown in Figure IC, pcPTM+Sp forms conventional Watson-Crick 
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base pairs by its binding domain with the 3' end of pHCG6 intron 1, masking the intronic 
3' splice signals of the target. This feature is designed to facilitate /raw5-splicing between 
the target and the PTM. 

HeLa nuclear extracts were used in conjunction with established splicing 
procedures (Pasman & Garcia-Blanco, 1996, Nucleic Acids Res. 24:1638) to test if a 
PTM construct could invade the pHCG6 pre-mRNA target. The products of in vitro 
frawiy-splicing were detected by RT-PCR, using primers specific for chimeric mRNA 
molecules. The predicted product of a successful /raw-splicing reaction is a chimeric 
mRNA comprising the first exon of pHCG6, followed immediately by the exon 
contributed from pcPTM+Sp encoding DT-A (Figure IC). Such chimeric mRNAs were 
readily detected by RT-PCR using primers pHCG-F (specific to pHCG6 exon 1) and DT- 
3R (specific to DT-A, Figure 2A, lanes 1-2). At time zero or in the absence of ATP, no 
466 bp product was observed, indicating that this reaction was both ATP and time 
dependent. 

The target binding domain of pcPTM+Sp contained 18 nucleotides 
complementary to pHCG6 intron 1 pre-mRNA and demonstrated efficient /rara-splicing 
(Figure 2A, lanes 1-2). Tran^-splicing efficiency decreased at least 8 fold (Figure 2, 
lanes 3-4) using non-targeted PTM-Sp, which contains a non-complementary 18 
nucleotide '^non-binding domain". TraK^-splicing efficiencies of PTM mRNAs with or 
without a spacer between the binding domain and BP were also compared. This 
experiment demonstrated a significant increase in the efficiency of trans-splicing by the 
addition of a spacer (Figure 2B, lanes 2 + 5). To facilitate the recruitment of splicing 
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factors required for efficient ^ra/w-splicing, some space may be needed between the 3' 
splice site and the double-stranded secondary structure produced by the binding 
domain/target interaction. 

To investigate the effect of PTM length on tram-spUcing specificity, 

5 shorter PTMs were synthesized from AccI cut PTM plasmid (see Figure 1). This 

eliminated 479 nt from the 3* end of the DT-A coding sequence. Figure 2B shows the 
trans-splicing ability of a targeted short PTM(+) (lanes 10-12), compared to a non- 
targeted short PTM(-) (lanes 14-17). Short PTM+ produced substantially more trans- 
spliced product (Figure 2B, lane 12) than its counterpart, non-targeted short PTM (Figure 

10 2B, lane 1 7). These experiments indicate that longer PTMs may have increased potential 
to mediate trans-splicing non-specifically. 



6.2.2. ACCURACY OF PTM SPLICEOSOME MEDIATED 
r/MAK-SPLICING 

To confirm that trans-splicing between the pcPTM+Sp and pHCG6 target 

1 5 is precise, RT-PCR amplified product was produced using 5' biotinylated pHCG-F and 

nonbiotinylated DT-3R primers. This product was converted into single stranded DNA 

and sequenced directly with primer DT-3R (DT-A specific reverse primer) using the 

method of Mitchell and Merril (1989, Anal. Biochem. 178:239). Traw^-splicing occurred 

exactly between the predicted splice sites (Figure 3), confirmii^ that a conventional pre- 

20 mRNA can be invaded by an engineered PTM construct during splicing; moreover, this 

reaction is precise. 
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In addition selective rraw^-splicing of a double splicing PTM (DS-PTM) 
was observed (Figure 8B). The DS-PTM can produce /r^ns-splicing by contributing 
either a 3' or 5' splice site. Further, DS-PTMs can be constructed which will be capable 
of simultaneously double-fran^-splicing, at both a 3' and 5* site, thereby permittmg exon 
5 replacement. Figure 8B demonstrates that in this construct the 5* splice site is most active 
at a 1 :1 concentration of target PHCG pre-mRNA:DS-PTM. At a 1 :6 ratio the 3* splice 
site is more active. 



6.2.3. 3' SPLICE SITES ARE ESSENTIAL FOR PTM TRANS-SPUCING 
In general, the 3* splice site contains three elements: 1) a BP sequence 

1 0 located 5* of the acceptor site, 2) a PPT consisting of a short run of pyrimidine residues, 
and 3) a YAG trinucleotide splice site acceptor at the intron-exon border (Senapathy et 
al, 1990, Cell 91 :875; Moore et aL, 1993). Deletion or alteration of one of tiiese 
sequence elements are known to either decrease or abolish spHcing (Aebi et al, 1986; 
Reed & Maniatis 1988, Genes Dev. 2:1268; Reed, 1989, Genes Dev. 3:21 13; Roscigno et 

15 al, 1993, J. Biol. Chem. 268:11222; Coolidge et al, 1997, Nucleic Acids Res. 25:888). 
The role of these conserved elements in targeted /ran^-splicing was addressed 
experimentally. In one case [(BP(-)Py(-)AG(-)], all three cis elements (BP, PPT and 
AG dinucleotide) were replaced by random sequences. A second splicing mutant [(Py(- 
)AG(-)] was constructed in which the PPT and the 3* splice site acceptor were mutated 

20 and substituted by random sequences. Neither construct was able to support trans- 

splicing in vitro (Figure 2A, lanes 5-8), suggesting that, as in tiie case of conventional cis- 
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splicing, the PTM trans-spUcmg process also requires a functional BP, PPT and AG 
acceptor at the 3' splice site. 

6.2.4, DEVELOPMENT OF A "SAFETY" SPLICE SITE 
TO INCREASE SPECIFICITY 

5 To improve the levels of target specificity achieved by the inclusion of a 

binding domain or by shortening the PTM, the target-binding domain of several PTM 
constructs was modified to create an intra-molecular stem to mask the 3* splice site 
(termed a "safety PTM"). The safety stem is formed by portions of the bindmg domain 
that partially base pair with regions of the PTM 3' splice site or sequences adjacent to 

10 them, thereby blocking the access of spliceosomal components to the PTM 3* splice site 
prior to target acquisition (Figure 4A, PTM+SF). Base pairing between fi-ee portions of 
the PTM binding domain and pHCG6 target region unwinds the safety stem, allowing 
spUcing factors such as U2AF to bind to the PTM 3' splice site and mitiate trans-splicing 
(Figure 4B). 

1 5 This concept was tested in splicing reactions containing either PTM+SF 

(safety) or pcPTM+Sp (linear), and both target (pHCG6) and non-target (P-globin) pre- 
mRNA. The spliced products were subsequently analyzed by RT-PCR and gel 
electrophoresis. Using pHCG-F and DT-3R primers, the specific 196 bp /ra/w-spliced 
band was demonstrated in reactions containing pHCG target and either linear PTM 

20 (pcPTM+Sp, Figure 5, lane 2) or safety PTM (PTM+SF, Figure 5, lane 8). Comparison 
of the targeted rr^/n^-splicing between linear PTM (Figure 5, lane 2) and safety PTM 
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(Figure 5, lane 8) demonstrated that the safety PTM rron^-spliced less efficiently than the 
linear PTM. 

Non-targeted reactions were amplified using P-globin-F (specific to 
exon 1 of p-globin) and DT-3R primers. The predicted product generated by non-specific 

5 PTM /ran5-splicing with p-globin pre-mRNA is 1 89 bp. Non-specific rrans-splicing was 
evident between Imear PTM and p-globin pre-mRNA (Figme 5, lane 5). In contrast, non- 
specific /r^iny-splicing was virtually eliminated by the use of safety PTM (Figure 5, 
lane 11). This was not unexpected, since the linear PTM was designed for maximal 
activity to prove the concept of spliceosome-mediated tra/iy-splicing. The open structure 

10 of the linear PTM combined with its potent 3* splice sites strongly promotes the binding 
of splicing factors. Once bound, these splicing factors can potentially initiate trans- 
splicing with any 5* splice site, in a process similar to /raiiy-splicing in trypanosomes. 
The safety stem was designed to prevent splicing factors, such as U2AF from binding to 
the PTM prior to target acquisition. This result is consistent with a model that base- 

1 5 pairing between the free portion of the binding domain and the pHCG6 target unwinds 
the safety stem (by mRNA-mRNA interaction), uncovering the 3' splice site, permitting 
the recruitment of splicing factors and initiation of /raw-splicing. No /raw-splicing was 
detected between p-globin and pHCG6 pre-mRNAs (Figure 5, lanes 3, 6, 9 and 12). 



6.2.5. IN VITRO TRANS-SPUCmG OF SAFETY PTM AND VARIANTS 
20 To better understand the role of cz5:-elements at the 3* splice site in trans- 

splicing a series of safety PTM variants were constructed in which either the PPT was 
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weakened by substitution with purines and/or the BP was modified by base substitution 
(see Table I). In vitro /ran^-splicmg efficiency of the safety (PTM+SF) was compared to 
three safety variants, which demonstrated a decreased ability to trans-splico. The greatest 
effect was observed with variant 2 (PTM+SFPy2), which was rran^-splicing incompetent 

5 (Figure 4C, lanes 5-6). This mhibition of /raw^-splicing may be attributed to a weakened 
PPT and/or the higher of the safety stem. In contrast, variations in the BP sequence 
(PTM+SFBP3) did not markedly effect /ran^-splicing (Figure 4C, lanes 7-8). This was 
not surprising smce the modifications introduced were within the mammalian branch 
point consensus range YNYURAC (where Y = pyrimidine, R = purine and N = any 

10 nucleotide) (Moore et al, 1993). This finding indicates that the branch point sequence 
can be removed without affecting spUcing efficiency. Alterations in the PPT (PTM+SF- 
Pyl) decreased tiie level of /ram-splicmg (lanes 3-4). Similarly, when both BP and PPT 
were altered PTM+SFBP3-Pyl, they caxised a fiirther reduction in /raw^-splicing 
(Figure 4C, lanes 9-10). The order of /ran^-splicing efficiency of these safety variants is 

1 5 PTM+SF>PTM+SFBP3> PTM+SFPyl>PTM+SFBP3-Pyl>PTM+SFPy2. These results 
confirm that both the PPT and BP are important for efficient in vitro trans-spWcmg 
(Roscigno et aL, 1993, L Biol. Chem. 268:1 1222). 

6.2.6. COMPETITION BETWEEN CIS- AND TRANS- SPLICING 

To determine if it was possible to block pre-mRNA czs-splicing by 
20 increasing concentrations of PTM, experiments were performed to drive the reaction 
towards trans-splicmg. Splicing reactions were conducted with a constant amount of 
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pHCG6 pre-mRNA target and various concentrations of /raw5-splicing PTM. Cis- 
splicing was monitored by RT-PCR using primers to pHCG-F (exon 1) and pHCG-R2 
(exon 2). This amplified the expected 125 bp m-spliced and 478 bp unspliced products 
(Figure 6A). The primers pHCG-F and DT-3R were used to detect trans-spliced products 

5 (Figure 6B). At lower concentrations of PTM, c/5-splicing (Fig. 6A, lanes M) 
predominated over /ran^-splicing (Figure 6B, lanes 1-4). Cf5-splicing was reduced 
approximately by 50% at a PTM concentration 1 .5 fold greater than target. Increasing the 
PTM mRNA concentration to 3 fold that of target inhibited cis-splicing by more than 
90% (Figure 6A, lanes 7-9), with a concomitant increase in the /ra«5'-spliced product 

1 0 (Figure 6B, lanes 6-10). A competitive RT-PCR was performed to simultaneously 
amplify both cis and frany-spliced products by including all three primers (pHCG-F, 
HCG-R2 and DT-3R) in a single reaction. This experiment had similar results to those 
seen in Figure 6, demonstrating that under in vitro conditions, a PTM can effectively 
block target pre-mRNA c/^-splicing and replace it with the production of an engineered 

1 5 trans-spliced chimeric mRNA. 

6.2.7. TRANS-SFLICING IN TISSUE CULTURE 
To demonstrate the mechanism of rraw5-splicing in a cell culture model, 
the human lung cancer line HI 299 (pHCG6 positive) was transfected with a vector 
expressing SP+CRM (a non-functional diphtheria toxin) or vector alone (pcDNA3.1) and 
20 grown in the presence of neomycin. Four neomycin resistant colonies were individually 
collected after 14 days and expanded in the continued presence of neomycin. Total 
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mRNA was isolated from each clone and analyzed by RT-PCR using primers PHCG-F 
and DT-3R. This yielded the predicted 196 bp rran^-spliced product in three out of the 
four selected clones (Figure 7A, lanes 2, 3 and 4). The amplified product from clone #2 
was directly sequenced, confirming that PTM driven /raw^-splicing occurred in human 
5 cells exactly at the predicted splice sites of endogenously expressed PHCG6 target exon 1 
and the first nucleotide of DT-A (Figure 7B). 

6.2.8. TTg^ATS^-SPLICING IN AN JN VIVO MODEL 
To demonstrate the mechanism of ^aw5-splicing in vivo, the following 
experiment was conducted in athymic (nude) mice. Tumors were established by injecting 

10 10^ HI 299 cells into the dorsal flank subcutaneous space. On day 14, PTM expression 
plasmids were injected into tumors. Most tumors were then subjected to electroporation 
to facilitate plasmid delivery (see Table 2, below). After 48 hrs, tumors were removed, 
poly-A mRNA was isolated and amplified by RT-PCR. Traw^-splicing was detected in 8 
out of 19 PTM treated tumors. Two samples produced the predicted rra«:s--spliced 

1 5 product (466 bp) from mENA after one round of RT-PCR. Six additional tumors were 
subsequently positive for fran^-splicing by a second PCR amplification using a nested set 
of primers that produced the predicted 196 bp product (Table 2). Each positive sample 
was sequenced, demonstrating that pHCG6 exon 1 was precisely trans-spMctd to the 
coding sequence of DT-A (wild type or CRM mutant) at the predicted splice sites. Six of 

20 the positive samples were from treatment groups that received cotransfected plasmids, 
pcPTM+CRM and pcHCG6, which increased the concentration of target pre-mRNA. 
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This was done to enhance the probability of detecting /ra«5-spliced events. The other 
two positive tumors were from a group that received only pcPTM+Sp (wild type DT-A). 
These tumors were not transfected with pHCG6 expression plasmid, demonstrating once 
again, as in the tissue culture model described in Section 6.2.7, that /ra/zs-splicing 



5 occurred between the PTM and endogenous pHCG6 pre-mRNA produced by tumor cells. 





Tabic 2. Trans-splicing in tumors in nude mice. 


Mouse 


Plasmid 


Left Right 


Elcctroporation 


RT-PCR 
Left Right 


Nested PGR 


Nucleotide Sequence 


'Bl 


nPMV-^nnrt 


Bl-l Bl-2 










B2 


pCMV-Sport 


Bl-3 Bl-4 


nooov/cm 


- 


- 


- 


B3 


pcSp+CRM 


B3-1 B3-2 


MOOOV/cra 












B3-3 B3-4 


MOOOV/cm 








B4 


pcSp+CRM 


B4-1 B4-2 


•^SOV/cm 












B4-3 B4.4 


'^25V/cm 








B5 


pcSp+CRM/ 
pcHCG6 


B5-1 B5-2 


MOOOV/cm 


+ 


+ + 


ATGTTCCAGi GGCGTGATGAT 
(SEQ IDNO:53) 






B5-3 B5-4 


•lOOOV/cm 


+ 




ATGTTCC AG 1 GGCGTGATGAT 
(SEQ IDNO:53) 


B6 


pcSp+CRM/ 
pcHCG6 


B6-1 B6-2 














B6-3 B6-4 


'25V/cm 






ATGTTCCAG 1 GGCGTGATGAT 
(SEQIDNO:53) 


B7 


pcPTM+Sp 


87-1 


MOOOV/cm 








B8 


pc PTM+Sp 


BS-l 


•^SOV/cm 




+ 


ATGTTCCAG i GGCGTGATGAT 
(SEQIDNO:53) 


■B9 


pcPTM+Sp 


B9-1 






+ 


ATGTTCCAG J GGCGTGATGAT 
(SEQIDNO:53) 



6 pulses of 99^s sets of 3 pulses administered orthogonally 
^: 8 pulses of IQms sets of 4 pulses administered orthogonally 
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8 pulses of 50ms sets of 4 pulses administered orthogonally 
: positive for RT-PCR trans-spliced produce 
: did not receive electroporation 



7. EXAMPLE: lacZ TRANS-S,VUCmG MODEL 
In order to demonstrate and evaluate the generality of the mechanism of 
spliceosome mediated targeted franj-splicing between a specific pre-mRNA target and a 
PTM, a simple model system based on expression of en^me (J-galactosidase was 
5 developed. The following section describes results demonstrating successful splicesome 
mediated targeted rra^w-splicing between a specific target and a PTM. 

7.1. MATERIALS AND METHODS 
7.1.1. PRIMER SEQUENCES 
The following primers were used for testing the lacZ model system: 

10 5' Lac-IF GCATGAATTCGGTACCATGGGGGGGTTCTCATCATCATC 

5' Lac-IR CTGAGGATCCTCTTACCTGTAAACGCCCATACTGAC 

3' Lac-IF GCATGGTAACCCTGCAGGGCGGCTTCGTCTGGGACTGG 

3' Lac-IR CTGAAAGCTTGTTAACTTATTATTTTTGACACCAGACC 

3' Lac-Stop GCATGGTAACCCTGCAGGGCGGCTTCGTCTAATAATGGGACTGGGTG 

15 HCG-InlF GCATGGATCCTCCGGAGGGCCCCTGGGCACCTTCCAC 

HCG-InlR CTGACTGCAGGGTAACCGGACAAGGACACTGCTTCACC 

HCG-Ex2F GCATGGTAACCCTGCAGGGGCTGCTGCTGTTGCTG 

HCG-Ex2R CTGAAAGCTTGTTAACCAGCTCACCATGGTGGGGCAG 
Lac-TRl (Biotin): 7-GGCTTTCGCTACCTGGAGAGAC 

20 Lac-TR2 GCTGGATGCGGCGTGCGGTCG 

HCG-R2: CGGCACCGTGGCCGAAGTGG 



7. 1.2. CONSTRUCTION OF THE lacZ PRE-mRNA TARGET MOLECULE 
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The lacZ target 1 pre-mRNA (pc3.1 lacTl) was constructed by cloning of 
the following three PGR products: (i) the 5' fragment of lacZ; followed by (ii) pHCG6 
intron 1; (iii) and the 3' fragment of lacZ. The 5' and 3' fragment of the lacZ gene were 
PGR amplified from template pcDNA3.1/HisAacZ (Invitrogen,San Diego, GA) using the 

5 following primers: 5* Lac-IF and 5'Lac-lR (for 5' fragment), and 3'Lac-lF and 3' Lac-IR 
(for 3* fragment). The amplified lacZ 5' fragment is 1788 bp long which includes the 
mitiation codon, and the amplified 3' fragment is 1385 bp long and has the natural 5' and 
3' splice sites in addition to a branch point, polypyrimidine tract and PHCG6 intron 1. 
The pHGG6 intron 1 was PGR amplified using the following primers: HCG-InlF and 

10 HCG-InlR. 

The lacZ target 2 is an identical version of lacZ target 1 except it contains 
two stop codons (TAA TAA) in frame four codons after the 3 'splice site. This was 
created by PGR amplification of the 3' firagment (lacZ) using the following primers: 3* 
Lac-Stop and 3' Lac IR and replacing the functional 3' Augment in lacZ target 1. 



15 7.1.3. CONSTRUGTION OF pc3.1 PTMl and pc3.1 PTM2 

The pre-trans-splicing molecule, pc3.1 PTMl was created by digesting 
pPTM +Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A 
toxin with die a DNA firagment encoding the fimctional 3' end of lacZ. This fragment 
was generated by PGR amplification using the following primers: 3' Lac- IF and 3' Lac- 

20 IR. For cell culture experiments, an EcoRI and Hindlll firagment of pc3.1 PTM2 which 
contains the binding domain to HGG intron 1, a 30 bp spacer, a yeast branch point 
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(TACTAAC), and strong polypyrimidine tract followed by the lacZ cloned was cloned 
into pcDNAS.l. 

The pre-trans-splicing molecule, pc3.1 PTM2 was created by digesting 
pPTM +Sp with PstI and Hindlll and replacing the DNA fragment encoding the DT-A 
5 toxin with the PHCG6 exon 2. pHCG6 exon 2 was generated by PGR amplification 
using the following primers: HCG-Ex2F and HCG-Ex2R. For cell culture experiments, 
an EcoRI and Hindlll fragment of pc3.1 PTM2 which contains the binding domain to 
HCG intron 1 , a 30 bp spacer, a yeast branch point (TACTAAC), and strong 
polypyrimidine tract followed by the PHCG6 exon 2 cloned was used. 

10 7.1 .4. CO-TRANSFECTION OF THE lacZ SPLICE TARGET 

PRE-mRNA AND PTMS INTO 293T CELLS 

Human embryonic kidney cells (293T) were grown in DMEM medium 

supplemented with 10% FBS at 37°C in a 5% CO2. Cells were co-transfected with pc3.1 

LacTl and pc3.1 PTM2, or pc3.1 LacT2 and pc3.1 PTMl, using Lipofectamine Plus 

1 5 (Life Technologies,Gaithersburg, MD) according to the manufacturer's instructions. 24 

hours post-transfection, the cells were harvested; total RNA was isolated and RT-PCR 

was performed using specific primers for the target and PTM molecules, p-galactosidase 

activity was also monitored by staining the cells using a p-gal staining kit (Invitrogen, 

San Deigo. OA). 

20 

7.2. RESULTS 
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7.2.L THE lacZ SPLICE TARGET C/S-SPLICES EFFICIENTLY TO 
PRODUCE FUNCTIONAL p-GALACTOSIDASE 

To test the ability of the splice target pre-mRNA to c/5-splice efficiently, 

5 pc3.1 lacTX was transfected into 293 T cells using Lipfectamine Plus reagent (Life 

Technologies,Gaithersburg, MD) followed by RT-PCR analysis of total RNA. Sequence 

analysis of the c/5-spliced RT-PCR product indicated that splicing was accurate and 

occurred exactly at the predicted splice sites (Fig. 12B). In addition, accurate cis-splicing 

of the target pre-mRNA molecule results in formation of a mRNA capable of encoding 

10 active p-galactosidase which catalyzes the hydrolysis of p-galactosidase, /. e. , X-gal, 

producing a blue color that can be visualized under a microscope. Accurate c/5-splicing of 

the target pre-mRNA was further confirmed by successfully detecting P-galactosidase 

enzyme activity. 

Repair of defective lacZ target 2 pre-mRNA by trans-splicing of the 
15 functional 3* lacZ fragment (PTMl) was measured by staining for P-galactosidase 
enzyme activity. For this purpose, 293T cells were co-transfected with lacZ target 2 
pre-mRNA (containing a defective 3* fragment) and PTMl (contain normal 3' lacZ 
sequence), 48 hours post-transfection cells were assayed for P-galactosidase enzyme 
activity. Efficient trans-splicing of PTMl into the lacZ target 2 pre-mRNA will result in 
20 the production of functional P-galactosidase activity. As demonstrated in Figure 1 IB-E, 
trans-splicing of PTM 1 into lacZ target 2 results in restoration of p-galactosidase enzyme 
activity up to 5% to 10% compared to control. 
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7.2.2. TARGETED riMATS-SPLICING BETWEEN 
THE lacZ TARGET PRE-mRNA and PTM2 

To assay for /raw^-splicing, lacZ target pre-mRNA and PTM2 were 

transfected into 293 T cells. Following transfection, total RNA was analyzed xising RT- 

5 PGR. The following primers were used in die PGR reactions: /acZ-TRl (lacZ 5* exon 
specific) and HGGR2 (pHCGR exon 2 specific). The RT PGR reaction produced the 
expected 195 bp fraws-spliced product ( Fig. 11, lanes 2 and 3) demonstrating efficient 
rrans-splicing between the lacZ target pre-mRNA and PTM 2. Lane 1 represents the 
control, which does not contain PTM 2. 

10 The efficiency of the ^ran^-splicing was also measured by staining for P- 

galactosidase enzyme activity. To assay for trans-splicing, 293T cells were co- 
transfected with lacZ target pre-mRNA and PTM 2. 24 hours post-transfection, cells 
were assayed for p-galactosidase activity. If there is efficient traw^-splicing between the 
target pre-mRNA and the PTM, a chimeric mRNA is produced consisting of the 5' 

1 5 fragment of the lacZ target pre-mRNA and pHCG6 exon 2 is formed which is incapable 
of coding for an active P-galactosidase. Results from the co-transfection experiments 
demonstrated that rra/iy-splicing of PTM2 into lacZ target 1 resulted in the reduction of 
p-galactosidase activity by compared to the control. 

To fijTther confirm that /ra/w-splicing between the lacZ target pre-mRNA 

20 and PTM2 is accurate, RT-PCR was performed usmg 5' biotinylated lacZ-TRl and 
non-biotinylated HGGR2 primers. Single stranded DNA was isolated and sequenced 
directly using HGGR2 primer (HGG exon 2 specific primer). As evidenced by the 
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sequence of the splice junction, /raw5-splicing occurred exactly as predicted between the 
splice sites (Fig. 12A and 12B), confirming that a conventional pre-mRNA can be 
invaded by an engineered PTM during splicing, and moreover, that this reaction is 
precise. 

8. EXAMPLE: CORRECTION OF THE CYSTIC FIBROSIS 
TRANSMEMBRANE REGUALTOR GENE 

Cystic fibrosis (CF) is one of the most common genetic diseases in the 

world. The gene associated with CF has been isolated and its protein product deduced 

(Kerem, B.S. etal., 1989, Science 245:1073-1080; Riordan et al., 1989, Science 

245:1066-1073;Rommans, et al., 1989, Science 245:1059-1065). The protein product of 

the CF associated gene is referred to as the cystic fibrosis trans-membrane conductance 

regulator (CFTR). The most common disease-causing mutation which accounts for 

"-70% of all mutant alleles is a deletion of three nucleotides in exon 10 that encode for a 

phenylalanine at position 508 (AF508). The following section describes the successfiil 

repair of the cystic fibrosis gene using spliceosome mediated trans-splicing and 

demonstrates the feasibility of repairing CFTR in a model system. 

8.1 MATERIALS AND METHODS 
8.1.1. PRE-TRANS-SPLICING MOLECULE 
The CFTR pre-trans-splicing molecule (PTM) consists of a 23 nucleotide 
bmding domain complimentary to CFTR intron 9 (3* end, -13 to -31), a 30 nucleotide 
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spacer region (to allow efficient binding of spliceosomal components), branch point (BP) 
sequence, polypyrimidine tract (PPT) and an AG dinucleotide at the 3' splice site 
immediately upstream of the sequence encoding CFTR exon 10 (wild type sequence 
containing F508). This initial PTM was designed for maximal activity in order to 

5 demonstrate /ran^-splicing; therefore the PTM included a UACUAAC yeast consensus 
BP sequence and an extensive PPT. An 18 nucleotide HIS tag (6 histamine codons) was 
included after wild type exon 10 coding sequence to allow specific amplification and 
isolation of the trans-spliced products and not the endogenous CFTR. The 
oligonucleotides used to generate the two fragments included imique restriction sites. 

10 (Apal and PstI, and PstI and NotI, respectively) to facilitate directed cloning of amplified 
DNA into the mammalian expression vector pcDNA3 J . 



8.1.2. THE TARGET CFTR PRE-mRNA MINI-GENE 
The CFTR mini-gene target is shown in Figure 1 3 and consists of CFTR 
exon 9 ; the functional 5' and 3* regions of intron 9 (260 and 265 nucleotides from each 
15 end, respectively); exon 10 [AF508]; and the 5' region of intron 10 (96 nucleotides). In 
addition, as depicted in Figure 16, a mini-target gene comprising CFTR exons 1-9 and 
10-24 can be used to test the use of spliceosome mediated trans-splicing for correction of 
the cystic fibrosis mutation. Figure 17, shows a double splicing PTM that may also be 
used for correction of the cystic fibrosis mutation. As shown, the double splicing PTM 
20 contains CFTR BD intron 9, a spacer, a branch point, a polypyrimidine tract, a 3' splice 
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site, CFTR exon 10, a spacer, a branch point, a polypyrimidine tract, a 5' splice site and 
CFTRBDexon 10. 

8.1.3. OLIGONUCLEOTIDES 
The following oligonucleotides were used to create CFTR PTM: 

S Forward CF3 

ACCT GGGCCC ACC CAT TAT TAG GTC ATT AT CCC/CGG AAC ATT ATA 
Apal site. Intron 9 CFTR, - 1 2 to -34. 

Reverse CF4 

ACCT CTGCAG G7(gy4CC CTG CAG GAA AAA AAA GAA G 
10 Pstl. BstEI. PPT. 

Forward CFS 

ACCT CTGCAG ACT TCA CTT CTA ATG ATG AT 
Pstl. Exon 10 CFTR, +1 to +24 

Reverse CF6 

15 ACCT GCGGCCGC CTA ATG ATG ATG ATG ATG ATG CTC TTC TAG TTG GCA TGC 

Not I. Stop Polyhistamine tag Exon 10 CFTR, +15 to +132 



The following nucleotides were used to create the CFTR TARGET pre-mRNA 
mini gene (Exon 9 + mini-Intron 9 + Exon 10 + 5' end Intron 10): 

Forward CF18 

20 GACCT CTCGAGGGATTTGGGGAATTATITGAG 

Xhol Exon 9 CFTR, 1 to 21. 

Reverse CF19 

CTGACCT GCGGCCGC TAC AGT GTT GAA TGT GGT GC 
Notl. Intron 9 5' end. 

25 Forward CF20 

CTGACCT GCGGCCGC CCA ACT ATC TG A ATC ATG TG 
Notl. Intron 9 3' end. 

Reverse CF21 

GACCT CTTAAG TAG ACT AAC CGA TTG AAT ATG 
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Aflll Intron 10 5* end. 

The following oligonucleotides were used for detection of trans-spliced products: 

Reverse Bio-His 

CTA ATG ATG ATG ATG ATG ATG 
5 Stop. Polyhistidine tag (5' biotin label). 

Reverse Bio-Hisf2) 

CGC CTA ATG ATG ATG ATG ATG 

3' UT Stop. Polyhistidine tag (5' biotin label). 

Forward CF8 
10 CTT CTT GGT ACT CCT GTC CTG 

Exon 9 CFTR. 

Forward CF18 

GACCT CTCGAG GGA TTT GGG GAA TTA TTT GAG 
Xhol. Exon 9 CFTR. 

IS Reverse CF28 

A AC TAG AAG GCA CAG TCG AGG 

Pc3.1 vector sequence (present in PTM 3' UT but not target). 

8.2. RESULTS 
The PTM and target pre-mRNA were co-transfected in 293 embryonic 

20 kidney cells using lipofectamine (Life Technologies,Gaithersburg, MD). Cells were 

harvested 24 h post transfection and RNA was isolated. Using PTM and target-specific 
primers in RT-PCR reactions, a /raw5-spliced product was detected in which mutant exon 
10 of the target pre-mRNA was replaced by the wild type exon 10 of the PTM 
(Figure 14). Sequence analysis of the trans-spliced product confirmed the restoration of 

25 the three nucleotide deletion and that splicing was accurate, occurring at the predicted 

splice sites (Figure 1 5), demonstrating for the first time RNA repair of the cystic fibrosis 
gene, CFTR (Mansfield et al., 2000, Gene Therapy 7:1885-1895). 
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9. EXAMPLE: DOUBLE-riMA/y-SPLICING 
The following example demonstrates accurate replacement of an internal 
exon by a double-Zraw^-splicing between a target pre-mRNA and a PTM RNA containing 
both 3* and 5' splice sites leading to production of full length functionally active protein. 

As described herein, any pre-mRNA can be reprogrammed by providing a 
/rdfW5-reactive RNA molecule containing either a 3'-splice site, a 5 -splice site or both. 
The following example describes successful targeting and replacement of a single internal 
exon utilizing pre-rraw5-splicing molecules (PTMs) containing both the 5* and 3* splice 
sites. Such PTMs can promote two /raw5-splicing reactions with the intended target gene 
mediated by the splicesome(s). To test this mechanism, a splicing iacZ model target gene 
consisting of lacZ 5' "exon" - CFTR mini-intron 9 - CFTR exon 10 (AF508) - CFTR 
mini-intron 10 followed by lacZ 3* "exon" was created. In this target transcript, a 124 bp 
central portion of the P-galactosidase ORF was substituted by exon 10 (AF508) of CFTR, 
thus it produces non-functional protein. A PTM consisting of the missing 124 bp lacZ 
"mini-exon" and a 5' and 3' /rara-splicing domain containing binding domains (BDs) 
complementary to the target introns and exons was created. Transfection of HEK 293T 
cells with either target alone or PTM alone showed no detectable levels of p-gal activity. 
In contrast, 293T cells transfected with target plus PTM produced substantial levels of p- 
gal activity indicating the restoration of protein function. The accuracy of trans-splicmg 
between the target and PTM was confirmed by sequencing the appropriate RT-PCR 
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product, which revealed the predicted internal exon substitution. The feasibility of this 
approach in a disease model was tested by replacing the CFTR AF508 exon 10 with 
normal exon 10 containing F508 in cystic fibrosis. These results demonstrate that a 
rran^-splicing technology can he easily adapted to correct many of the genetic defects 
whether they are associated with the 5' exon or 3* exon or any internal exon of the gene. 

Figure 18 is a schematic of a model lacZ target consisting of lacZ 5' exon - 
CFTR mini-intron 9 - CFTR exon 10 (delta 508) - CFTR min-intron 10 followed by the 
lacZ 3' exon. In this target, a 124 bp central portion of the lacZ gene is substituted with 
CFTR exon 10 which has a mutation at position 508 (delta 508). The pre-mRNA target 
undergoes normal cz5-splicing to produce an mRNA consisting of lacZ 5' exon - CFTR 
exon 10 (delta 508) followed by the lacZ 3' exon. Because of the disruption in 
p-galactosidase ORF it producies truncated proteins which are non-functional. 

To restore P-gal fimction by double-/ra?ir-splicing, three PTMs were 
created consisting of the missing 124 bp lacZ "mini-exon" and a 5' and 3* tranS'Splicing 
domain containing binding domains complementary to the target introns and exons as 
shown in Figure 19. These PTMs have an 120 bp 3' binding domain (complementary to 
intron 9) from PTM24 (see below) used in 3* exon replacement, spacer sequence, yeast 
branch point, polypyrimidine tract, 3* acceptor AG dinucleotide, lacZ "mini-exon", 5' 
splice site, spacer sequence followed by the 5* binding domain. These PTMs differ only 
in their 5' binding domain sequences. DSPTM5 has a 27 bp BD which is complementary 
to intron 10 and blocks just the 5' splice site of the target. DSPTM6 has 120 bp 5' BD 
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and covers both 5' and 3' splice sites of the target, while, DSPTM7 has 260 bp BD which 
masks both the splice sites (5' and 3') and also covers the entire exon of the target 

A schematic representation of a double-/ra«s-splicing reaction showing 
the binding of DSPTM7 with DSCFT1.6 target pre-mKNA is shown in Figure 20. 3* BD: 
5 1 20 bp binding domain complementary to mini-intron 9; 5' BD (260 bp); second binding 
domain complementary to mini-intron 10 and exon 10. ss: splice sites; BP: branch point, 
and PPT: polypyrimidine tract. 

The important structural elements of DSPTM7 (Figure 21) are as follows: 

(n3'BDfl20BP> : GATTCACTTGCTCCAATTATCATCCTAAGCAGAAGTGTAT 
10 ATTCTTATTTGTAAAGATTCTATTAACTCATTTGATTCAA 

AATATTTAAAATACTTCCTGTTTCATACTCTGCTATGCAC 

(2) Spacer sequences (24 hp) : AACATTATTATAACGTTGCTCGAA 

(3) Branch point pvrimidine tract and acceptor splice site : 

3'ss 

15 BP Kpnl PPT EcoRV 1 lacZ mini-exon 

TACTAAC T GGTACC TCTTCTTTTTTTTTT GATATC CTGCAG | GGCGGC | 

(4") 5' donor site and 2"* spacer sequence : 

5' ss 

lacZ mini-^xon 1 

20 | TGAACG | GTAAGT GTTATCACCGATATGTGTCTAACCTGATTCGGGCCTTC 
GATACGCTAAGATCCACCGG 

(5) 5' BD (260 BPV TCAAAAAGTTTTCACATAATTTCTTACCTCTTCTTGAATT 
CATGCTTTGATGACGCTTCTGTATCTATATTCATCATTGG 
AAACACCAATGATTTTTCTTTAATGGTGCCTGGCATAATC 
25 CTGGAAAACTGATAACACAATGAAATTCTTCCACTGTGC 

TTAAAAAAACCCTCTTGAATTCTCCATTTCTCCCATAATC 
ATCATTACAACTGAACTCTGGAAATAAAACCCATCATTA 
TTAACTCATTATCAAATCACGC 
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To determine whether the restoration of p-gal function is RN A trans- 
splicing mediated, the mutants are depicted in Figure 22. DSPTM8 is a 3' splice mutant 
in which the 3' splice elements such as BP, polypyrimidine tract and the 3' acceptor 
AG dinucleotides were deleted and replaced with random sequences. This PTM still has 
3* and 5' binding domains and the functional 5' splice site. PTM29 lacks the 2"** binding 
domain + 5' ss but still has the 3* binding domain 3' splice site, while PTM30 lacks the 
1^ binding domain + 3' splice site but has the functional 5* splice site and 2"** binding 
domain. 

To examine the double-Zraw^-splicing mediated restoration of p-gal 
function, 293T cells were either transfected with 2 of target or PTM alone or 
co-transfected with 2 //g of target -i- 1 .5 /zg of PTM using Lipofectamine Plus reagent. 
48 hrs. after transfection, total RNA was isolated and analyzed by RT-PCR using Kl-IF 
and Lac-6R primers. These primers amplify both cis- and /rflK5-spliced products in a 
single reaction which were identified based on the size. The cf^-spliced product is 295 bp 
in size while the frara-spliced product is 230 bp in size. To confirm that /ran^-splicing 
between DSPTM7 and DSCFT1.6 pre-mRNA is precise, RT-PCR amplified products 
were excised, re-amplified using K1-2F and Lac-6R primers and sequenced directly using 
K1-2F or Lac-6R primers. As shown in Figure 23 rrow-splicing occurred exactly at the 
predicted splice sites, confirming the precise internal exon substitution by two 
tram-s^Yxcmg events. 

The repair of defective lacZ pre-mRNA by double rraw^-splicing events 
and subsequent production of full-length p-gal protein was investigated in co-transfection 
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assays. 293T cells were co-transfected with DSCFT1.6 target and DSPTM7 expression 
plasmids, as well as with DSCFTl ,6 target or DSPTM7 alone as controls. Western blot 
analysis of total cell ly sates using polyclonal anti-P-galactosidase antiserum specifically 
recognized a - 120 kDa protein only in cells co-transfected with DSCFTl. 6 target + 

5 DSPTM7 plasmids (Fig. 24, lanes 3 and 4) but not in cells transfected with either 

DSCFTl. 6 target (Lane 1) or DSPTM7 plasmid alone (Lane 2). Similarly, no full-length 
protein was detected in cells co-transfected with DSCFTl .6 target + 3* splice mutant 
(Lane 5 and 6) or PTM29 or 30 in which either 3' /raw-splicing domain or 5' 
/rans-splicing domains has been deleted (Lane 7). In addition, the 120 kDa protein band 

10 co-migrated with the full-length functional p-gal produced using lacZ-Tl plasmid 

(positive control, data not shown). These results not only confirmed the production of 
full-length protein by double-/raw5-splicing between the target and PTM but also 
demonstrated that both the 3* splice site and 5' splice sites are essential for this process. 

To determine whether the full-length protein produced by double-Zran^- 

1 5 splicing between the target pre-mRNA and DSPTM7 RNA is functionally active, 293T 
cells were co-transfected with DSCFTl. 6 targeted + one of the double splicing PTMs 5, 6 
or 7 expression plasmids, or transfected with DSCFTl. 6 target or DSPTM7 alone. Total 
cell extracts were prepared and assayed for (5-gal activity using ONPG assay (Invitrogen). 
P-gal activity in extracts prepared from cells transfected with either DSCFTl. 6 target or 

20 DSPTM7 alone was abnost identical to the background levels detected in mock 

transfection (Fig. 25). In contrast, 293T cells co-transfected with DSCFTl .6 target and 
DSPTM7 produced - 21 fold higher levels of p-gal activity over the background 
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(Fig. 25). These results confirmed the accurate double-/raw5-splicmg between the target 
pre-mRNA and PTM RNA and production of the full-length functional protein. 

To confirm that restoration of p-gal activity by double-/ra«5-splicing 
reaction is absolutely depended on the presence of both 3' and 5* splice sites of the PTM, 

5 we constructed several mutants: (a) DSPTM8, is identical to DSPTM7 except the 

functional 3* spice elements (branch point, polypyrimidine tract and the 3' acceptor AG 
dinucleotides) were deleted and substituted with random sequences (see Fig. 22 for 
details); (b) PTM29 lacks 5' splice site as well as the 5' binding domain but has the 
3* binding domain and 3* splice site, and (c) PTM30 lacks 3' binding domain and 3' splice 

1 0 site but has the 5* splice site and 5' binding domain, p-gal activity m extracts prepared 
from cells transfected with either DSCFTl .6 target or DSPTM7 alone was almost 
identical to the background levels detected in mock transfection (Fig. 26). Similarly, no 
significant increase in P-gal activity was detected in cells transfected with either 
DSPTM8 alone (3* splice site mutant) or co-transfection of DSCFTl. 6 target + one of the 

1 5 above mutant PTMs. On the other hand, cells co-transfected with DSCFTl .6 target and 
DSPTM7 with fimctional 3* and 5' splice sites produced substantial levels of p-gal 
activity over the background (Fig. 26). These results confirmed the requirement of both 
splice sites in the double-splicing PTM and also eliminated the possibility that restoration 
of P-gal activity was due to complementation between the truncated proteins (Fig. 26). 

20 Different concentrations of the target and PTM were co-transfected and 

analyzed for p-gal activity restoration. As expected, 293T cells co-transfected with 
DSCFTl. 6 target + DSPTM7 showed substantial levels of p-gal activity (- 30 fold) over 

NY02:302146 1 "77- 



3f^B-A-B 072874.0135 



the controls. Increasing the concentrations of the PTM by 2 and 3 fold did increase the 
level of p-gal activity, but not significantly (Fig. 27). These results further confirmed the 
double-Zrcm-splicing mediated restoration of p-gal enzyme function. 

The specificity of double-/rflM^-splicing reaction was examined by 

5 constructing a non-specific target (DSHCGTl .1) which is similar to that of specific target 
(DSCFT1.6) but has pHCG intron 1 - pHCG exon 2 and pHCG intron 2 instead of CFTR 
mini-intron 9 - CFTR exon 10 (delta 508) and CFTR mini-intron 10 (Fig. 28). RT-PCR 
analysis of the total RNA isolated firom cells transfected with either DSHCGTl. 1 (non- 
specific target) alone or m combination DSPTM7 (targeted to DSCFT1.6 target) failed to 

1 0 produce the expected 3 1 4 bp double-Zran^-spliced product. On the other hand, RT-PCR 
analysis of the total RNA prepared from cells co-transfected with specific target PTM 
produced the expected 314 pb product. This was further confirmed by p-gal activity 
assay of the total cellular extract. The level P-gal activity detected in cells transfected 
with non-specific target alone or in combination with DSPTM7 targeted to DSCFT1.6 

1 5 target was almost identical to the background level. In contrast substantial levels of p-gal 
activity was detected in cells co-transfected with specific target (DSCFT1.6) + DSPTM7 
(Fig. 27). These results confirmed that the double-/ra«5-splicing is highly specific. 

The repair model in Fig. 30 shows a portion of a target CFTR pre-mRNA 
consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini- 

20 intron 10 and exons 1 1-24 (Fig. 30). The PTM shown in the figure consists of exon 10 
coding sequences (containing codon 508) and two /ra/zy-splicing domains each with its 
own splicing elements (acceptor and donor sites, branchpoint and pyrimidine tract) and a 
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binding domain complementary to intron 9 splice site, part of exon 10 (5' and 3' ends) and 
intron 10 5* splice site (Fig. 31 (DS-CFl)). Exon 10 of the PTM also has modified codon 
usage throughout to reduce antisense effects between exon 10 of the PTM and it's own 
binding domains and for PTMs that have binding domains which are complementary to 
exon sequences (Fig. 31). A doubie-Zrara-splicing event between the PTM and target 
should produce a repaired full-length mRNA. 

Fig. 32 shows the sequence of a single PGR product shoving target exon 9 
correctly spliced to PTM 20 exon 10 (with modified codons) (upper panel), codon 508 in 
exon 10 of the PTM (middle panel) and PTM exon 10 correctly spliced to target exon 1 1 
(lower panel). The sequence of a repaired target was generated by RT-PCR followed by 
PGR. 

10. EXAMPLE: 77MJV5-SPLIGING REPAIR OF THE 

GYSTIG FIBROSIS GENE USING A PTM 

THAT CAN PERFORM 5' EXON REPLACEMENT 

The key advantage of using 5' exon replacement for gene repair are 

(a) it permits replacement of the 5' portion of a gene 

(b) the construct requires less sequence and space than a full-length gene construct, 

(c) PTMs can be produced that lack a polyA signal which should prevent PTM 
translation, and (d) the 5' end can be modified to increase translation. 
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10.1 MATERIALS AND METHODS 
10.1.1 PLASMID CONSTRUCTION 
The CFTR coding sequences (exons 1-10) for PTM30 were generated by 
PCR using a partial cDNA plasmid template (61 160; American Type Culture Collection, 

5 Manassas, VA). The trans-splicmg domain (TSD) [including the binding domain, spacer 
sequence, polypyrimidine tract (PPT), branchpoint (BP) and 3* splice site] was generated 
from a PCR product (using an existing plasmid template) and by annealing 
oligonucleotides. The different fragments (the TSD and coding sequences) were then 
cloned into pcDNA3.1(-) using appropriate restriction sites. Oligodeoxynucleotide 

10 primers were procured from Sigma Genosys (The Woodlands, TX). All PCR products 
were generated with either REDra^ (Sigma, St. Louis, MO), or cloned PJu (Stratagene, 
La JoUa, CA) DNA Polymerase. PCR primers for amplification contained restriction 
sites for directed cloning. PCR products were digested with the appropriate restriction 
enzymes and cloned into the mammalian expression plasmid pc3.1DNA(-) (Invitrogen, 

15 Carlsbad, CA). 



10.2 CELL CULTURE AND TRANSFECTIONS 
Constructs were cotransfected in human embryonic kidney (HEK) 293T or 
293 ceils (1.25 x 10^ cells per 60 mm poly-d-lysine coated dish) using LipofectaminePlus 
(Life Technologies, Gaithersburg, MD) and the cells were harvested 48 h after the start of 
20 transfection. Total RNA was isolated as described in the manufacturers instructions 
(Epicenter Technologies, Inc.). HEK 293T cells were grown in Dulbecco*s Modified 
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Eagle's Medium (Life Technologies) supplemented with 10% v/v fetal bovine serum 
(Hyclone, Inc., Logan, UT). All cells were kept in a humidified incubator at 37''C and 
5% CO2. 

10.1.3 REVERSE TRANSCRIPTION-PQLYMERASE CHAIN 
5 REACTION (TR-PCR) 

RT-PCR was performed using an EZ-RT-PCR kit (Perkin-Ehner, Foster, 

CA). Each reaction contained 0.03 to 1 .0 ptg of total RNA and 80 ng of a 5' and 3* 

specific primer in a 40 jul reaction volume. RT-PCR products were electrophoresed on 

2% Seaken agarose gels. The PTM- and target-specific oligonucleotides used to generate 

10 fraw5-spliced products are 5'-CGCTGGAAAAACGAGCTTGTTG-3' (primer CF93) and 
5'-ACTCAGTGTGATTCCACCTTCTC-3' (primer CFl 1 1), respectively. The PTM- and 
target-specific oligonucleotides used to generate cz5-spliced products were CFl and 
CF93. The sequence of oligonucleotide CFl is 
5'-GACCTCTGCAGACTTCACTTCTAATGATGATTATGG-3'. 

15 The repair model in Fig. 33 shows a portion of a target CFTR pre-mRNA 

consisting of exons 1-9, mini-intron 9, exon 10 containing the delta 508 mutation, mini- 
intron 10 and exons 1 1-24 (Fig. 33). The PTM shown in the figure consists of exon 1-10 
coding sequences (containing codon 508) and a /rcfrt^-splicing domain with its own 
splicing elements (donor site, branchpoint and pyrimidine tract) and a binding domain. 

20 Several PTMs have been constructed with different binding domains. Three examples 
are shown in Figure 34. In Fig. 34A the bmding domain is complementary to the splice 
site of intron 9 and part of exon 10 (3' end; CF-PTM 1 1). In Fig. 34B the PTM has an 



NYO2 302146 1 



-81- 



072874.0135 



extended binding domain which also covers the 5* end of exon 10 and the 3* splice site of 
intron 9 (CF-PTM 20). In the last example (Fig. 34C) the bindmg domain is the same as 
that shown in panel B except the binding domain extends the full-length of exon 10 (CF- 
PTM 30), In the latter case the PTM exon 10 has modified codon usage to reduce 

5 antisense effects with it's own binding domain (Fig. 34). Further examples of binding 
domains are shown in Figure 35. 

Figure 36 shows the sequence of cis- and ^an^-spliced products. The top 
panel of Fig. 36A shows target exon 10 with it's three missing nucleotides (CTT), whilst 
the lower panel shows exon 10 and 1 1 of the target correctly spliced together. 

1 0 Figure 36B is a partial sequence of a single PGR product showing the modified codons in 
exon 10 of the PTM (upper panel), codon 508 in exon 10 of the PTM (middle panel), and 
PTM exon 10 correctly spliced to target exon 1 1 (lower panel), indicating that /raw- 
splicing is accurate. The sequence of the repaired target was generated by RT-PCR 
followed by PGR. 



15 11. EXAMPLE: PTMs WITH A LONG BINDING 

DOMAIN, WmGH MAY BE DISGONTIN- 
UOUS, HAVE INCREASED TRANS'S?UCmG 
EFFICIENCY AND SPECIFICITY 

11.1. MATERIALS AND METHODS 

20 11.1.1. CELL CULTURE 

Human embryonic kidney cells (293 or 293T) were iSrom the University of 

North Carolina tissue culture facility at Chapel Hill (Chapel Hill, NC). Cells were 
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maintained at 37°C in a humidified incubator with 5% CO2 in Dulbecco's modified 
Eagle's medium (Life Technologies, Bethesda, MD) supplemented with 10% v/v fetal 
bovine serum (Hyclone, Logan, UT). Cells were passaged every 2-3 days using 0.5% 
trypsin and re-plated at the desired density. Stable cells, expressing an endogenous 
5 mutant lacZ pre-mRNA (lacZCF9) were maintained in the presence of 0.5 mg/ml G4 1 8 
(Calbiochem, San Diego, CA). 

11. L2. RECOMBINANT PLASMDS 
Targets: pc3.11acZCF9, pc3.11acZCF9m, and pc3.11acZHCGlm. 
pc3.11acZCF9 encodes for a normal /acZ pre-mRNA was constructed using /acZ codmg 

10 sequences nucleotides 1-1788 as 5* exon, CFTR mini-intron 9 followed by tocZ coding 
sequences nucleotides 1789-3174 as 3' exon. This is similar to pc3.11acZ-T2 construct 
but without stop codons in the lacZ 3' exon and has CFTR mini-intron 9 instead of 
pHCG6 intron 1 (Fig. 37A). CFTR mini-intron 9 was PCR amplified using plasmid T5 
as template and primers CFIN-9F (S'-CTAGOATCCCGTTCTTTTGTTCTTCACT 

1 5 ATTAA) and CFIN-9R f5'-CTAG GGTTACC GAAGTAAAACCATACTTATTAG, 
restriction sites imderlined), digested with BamHl and BstE II and cloned in place of 
BHCG6 intron 1 of pc3.11acZ-T2 plasmid. pc3.11acZCF9m expresses a defective lacZ 
pre-mRNA and is identical to pc3.11acZCF9 but contains two in-firame non-sense codons 
in the 3' exon (Fig. 37A). pc3.11acZHCGlm is a chimeric target, which includes the lacZ 

20 5* exon followed by intron 1 and exon 2 of pHCG6. This is similar to pc3. llacZCF9m 
except that it contains exon 2 of pHCG6 in place of mutant lacZ 3' exon. pHCGr6 exon 2 
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was PGR amplified using pHCG6 plasmid (accession # X00266) as template DNA and 
primers HCGEx-2F rS'-GCAT GGTTACC CTGCAGGGGCTGCTGCTGTTGCTG) and 
HCGEX.2R (5»-CTGAAAGOTGTTAACCAGCTCACCATGGTGGGGCAG, 
restriction sites underlined) digested with BstE II and Hind III and cloned in place of the 

5 lacZy exon of pc3.11acZCF9m. Plasmid pcDNA3.l/HisB//acZ (Invitrogen, Carlsbad, 
CA) was used as DNA template to produce 5' and 3' lacZ exons. The lacZ 5* exon is 
1788 bp long, has an ATG initiation codon, lacZ 3' exon (without stop codons) is 1385 bp 
long and has a transcription termination signal at the end of the 3 ' exon. CFTR 
mini-intron 9 and pHCG6 intron 1 are 548 bp and 352 bp in size, respectively, and both 

10 have 5* and 3' splice signals. Exon 2 of pHCG6 is 162 bp long and has a transcription 
termination signal at the end of the exon. 

Pre-trans-splicing Molecules (PTMs): PTM-CF14 is an identical version 
of pcPTMl with minor modifications in the /raw5-splicing domain (Fig. 37B). 
PTM-CF14 is a linear version and contains a 23 bp antisense binding domain (BD) 

1 5 (5*-ACCCATCATTATTAGGTCATTAT) complementaiy to CFTR mini-intron 9, 1 8 bp 
spacer, a canonical branch point sequence (UACUAAC; BP) and an extended 
polypyrimidine tract (PPT) followed by normal lacZ 3' exon. PTM-CF22, PTM-CF24, 
PTM-CF26 and PTM-CF27 are identical to PTM-CF14 except they differ in length of the 
BD (Fig. 37B). sPTM-CF18 has a 32 bp BD, sPTM-CF22 and sPTM-CF24 contain die 

20 same BD as PTM-CF22 and PTM-CF24, respectively. In these PTMs, the binding 

domains were modified to create mtra-molecular stem-loop structure ("safety") to mask 
the 3* splice-site of the PTM. Different bmding domains were produced by PGR 
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amplification using specific primers (with unique Nhe I and Sac II sites) and a plasmid 
containing CFTR mini-intron 9 as template. PGR products were digested with Nhe I and 
Sac II and cloned into a PTM plasmid consisting of spacer sequences, 3' splice elements 
(BP, PPT and acceptor AG dinucleotide) followed by a normal lacZ 3* exon. 

1L1.3. TRANSFECTION OF PLASMID DNAs INTO 293T CELLS 
The day before transfection, 1x10^ 293T cells were plated on 60 mm 
plates coated with Poly-D-lysine (Sigma, St. Louis, MO) to enhance the adherence of 
cells and grown for 24 hr at 37 °C. Cells were transfected with expression plasmids using 
LipofectaminePlus reagent according to standard protocols (Life Technologies, Bethesda, 
MD). In a typical co-transfection, 2 /zg of pc3.11acZCF9m target and 1.5 of PTM 
expression plasmids were transfected into cells and for controls (target and PTM alone 
transfections) total DNA concentration was normalized to 3.5 /ig with pcDNA3.1 vector. 

Forty-eight hours after transfection the plates were rinsed with PBS, cells 
harvested and total RNA or DNA was isolated using MasterPure RNA/DNA purification 
kit (Epicenter Technologies, Madison, WI). Contaminating DNA in the RNA preparation 
was removed by treating with DNase I, while, contaminating RNA in the DNA 
preparation was removed by digesting with RNase A at 37°C for 30-45 min. 
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1 1 .1.4. REVERSE TRANSCRIPTION-POLYMERASE 
CHAIN REACTION fRT-PCR) 

RT-PCR was perfonned as suggested by manufacturer using an EZ rTth 

RNA PCR kit (Perkins-Elmer, Foster City, CA). A typical reaction (50 fxY) contained 

25-500 ng of total RNA, 100 ng of 5' target specific primer (common to cis- and 

rra«5-spliced products) (Lac-9F, 5'-GATCAAATCTGTCGATCCTTCC) and 100 ng of 

3' primer (Lac-3R, 5'-CTGATCCACCCAGTCCCATTA, target specific primer for 

cf^-splicing, and Lac-5R, 5'-GACTGATCCACCCAGTCCCAGA, PTM specific primer 

for transspMcmg), IX reverse transcription buffer (100 mM Tris-HCl, pH 8.3, 900 mM 

KCL with 1 mM MnCl2), 200 ^uM dNTPs and 10 units of rTth DNA polymerase. 

RT reactions were performed at eO^'C for 45 min. followed by 30 sec pre-heating at 94 ''C 

and 25-35 cycles of PCR amplification at 94°C for 18 sec, annealing and extension at 

60°C for 1 min followed by a final extension at 70°C for 7 min. The reaction products 

were analyzed by agarose gel electrophoresis. 

11.1.5. PROTEIN PREPARATION AND p-GAL ASSAY 
Total cellular protein fi-om cells transfected with expression plasmids was 
isolated by fireeze thaw method and assayed for p-galactosidase activity using a p-gal 
assay kit (Invitrogen, Carlsbad, CA). Protem concentration was measured by the 
dye-bmding assay using Bio-Rad protein assay reagents (BIO-RAD, Hercules, CA). 
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11.1.6. WESTERN BLOT 
About 5-25 //g of total protein was electrophoresed on a 7.5% SDS-PAGE 
gel and electroblotted onto PVDF-P membrane (Millipore), After blocking for 1 hr at 
room temperature (blocking buffer: 5% dry milk and 0.1% Tween-20 in IX PBS), the 
blot was incubated with a 1 :2500 dilution of polyclonal rabbit anti-p-galactosidase 
antibody for 1 hr at room temperature (Research Diagnostics hic. NJ), washed 3x with 
blocking buffer and then incubated with a 1:5000 diluted anti-rabbit HRP conjugated 
secondary antibody. After incubating at room temperature for 1 hr, it was washed 3x in 
blocking buffer and developed usmg ECLPlus Western blotting reagents (Amersham 
Pharmacia Biotech, Piscataway, NJ). 

11.1.7. INSITUQ-GAL STAINING 
Cells were monitored for the expression of functional p-galactosidase 
using a p-gal staining kit (Invitrogen, Carlsbad, CA), The percentage of p-gal positive 
cells were determined by coimting stained vs. unstained cells in 5-10 randomly selected 
fields. 

11.1.8. SELECTION OF NEOMYCIN RESISTANT CLONES 
EXPRESSING AN ENDOGENOUS DEFECTIVE lacZ 
PRE-mRNA TARGET 

On day 1, 1 X 10^ 293 cells were plated on 60 mm plates and grown for 

20 24 hr at 37°C. On day 2, the cells were transfected with 2 /zg of pc3.11acZCF9m using 

LipofectaminePlus transfection reagent as described above. 48 hr post-transfection, cells 
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were split (1 :20 ratio) and grown in media containing 0.5 mg/ml G418. At the end of 
2 weeks, neomycin resistant colonies were selected, pooled, expanded and maintained 
constantly in the presence of G418. 

11.2. RESULTS 

5 A model system was developed that permits facile and versatile analysis of 

spliceosome mediated RNA /raw^-splicing in cells. The bacterial lacZ gene was split 
with a truncated intron 9 from the Cystic Fibrosis Transmembrane Conductance 
Regulator (CFTR) gene (Figure 37A). This split lacZ gene, when introduced into human 
293T cells, directed the synthesis of a lacZ pre-mRNA diat could splice properly. The 

1 0 open reading frame of the lacZ gene was mutated by insertion of two in-frame nonsense 
codons near the 5' end of the second exon (Figure 37A). This /acZgene is referred to as 
lac2CF9m. In 293T cells, lacZCF9m directs the synthesis of lacZCF9m pre-mRNA, 
which encodes a truncated P-galactosidase (p-gal) protem that does not have enzymatic 
activity. Cells bearing the lacZCF9m gene are a model system for genetic disorders 

15 caused by loss of function mutations. 

Pre-/r^/«5-splicing molecules (PTMs) were designed to rron^-splice with 
lacZCF9m pre-mRNA and repair the mutation caused by the two nonsense codons. 
PTMs were constructed with binding domains spanning 23, 91 and 153 nucleotides (nt), 
which we named PTM-CF14, PTM-CF22 and PTM-CF24 (Figure 37B). The PTM-CF24 

20 bindii^ domain does not bind 1 53 contiguous nt in the targeted CFTR gene intron 9, but 
rather creates a loop of 47 nt in the target in between two regions of complementary of 27 
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and 126 nt (Figure 37B). These PTMs were predicted to repair the deficiency created by 
lacZCF9m (Figure 37C). 

Semi-quantitative RT-PCR analysis was used to tests the efficiency of 
/rcin^-splicing mediated by PTMs with long target binding domains. Repair of lacZCF9m 

5 transcripts by rran^-splicing was tested in two different ways: co-transfection of PTM and 
target (lacZCF9m) plasmids or transfection of cells that had been modified to express the 
target as an endogenous pre-mRN A. Co-transfecting plasmids encoduig PTMs with the 
lacZCF9m plasmid provided a facile method for screening the former for efficiency. 
PTM-CF22 and PTM-CF24 were approximately 3-fold and 1 0-fold more efficient than 

1 0 PTM-CF 1 4 in a semi-quantitative RT-PCR assay suggesting a significant improvement m 
mRNA repair (Figure 38). Sequencing of the RT-PCR products showed that trans- 
splicing was accurate, resulting in proper ligation of the exons from the target and the 
PTM. Moreover, mutation of key ci^-acting elements in the 3' splice site of the PTMs 
resulted in an abrogation of rran^-splicing. In these and all other assays described herein 

15 controls were carried out to rule out recombination at the DNA level. Thus, repair of the 
lacZCF9m transcripts was a result of targeted RNA /ran^-splicing. 

Transfection of PTM-CF14, -CF22 or -CF24 into 293 cells bearing an 
endogenous lacZCF9m gene confirmed that the longer target binding domains provided 
the PTMs with higher efficiency (Figure 38B). It should be noted that similar levels of 

20 RT-PCR /ra«5-splicing specific product were obtained after 30 PGR cycles and 35 cycles 
for PTM-CF24 and PTM-CF14, respectively. The data therefore suggests that PTMs 
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with long binding domains repaired lacZCF9m transcripts at least an order of magnitude 
better than previously described PTMs. 

More than one in ten transcripts of lacZCF9m can be repaired by trans- 
splicing. Quantitative, real-time PGR was used to measure the fraction of lacZCF9m 

5 transcripts repaired by PTMs with long binding domains. The co-transfection assay 
described above was used in these experiments. PTM-CF14, which contains a binding 
domain of 23 nt, was shown to repair between 1,2 and 1.6% of lacZCF9m RNAs in 293T 
cells and 2.1% of lacZCF9m RNAs in the H1299 human lung cancer cells. PTM-CF24, 
which has a 153 not long binding domain, was significantly more efficient, correcting 

10 between 12.1 and 15.2% of lacZCF9m RNAs in 293T cells and 19.7% m H1299 cells. 
This in effect resulted in a measurable reduction in the levels of lacZCF9m mRNA. 
These data also confirmed the remarkable capability of this RT-PCR assay to distinguish 
between the products of c/^-splicing, the lacZCF9m and mRNA, and the products of 
frflw^-spUcing, repaired lacZCF9m mRNA. This is the first true quantification of the 

15 efficacy of rraw-splicing mediated mRNA repair at the RNA level. These data confirm 
the suggestions of the semi-quantitative RT-PCR analysis shown above. Similar 
experiments were carried out using 293 cells that express an endogenous lacZCF9m pre- 
mRNA target Consistent with the data shown above, PTM-CF24 was ten times more 
efficient than PTM-CF 14, with the former correcting between 1 .3 and 4. 1% of 

20 endogenous lacZCF9m transcripts. These data confirmed that increasing the length of the 
PTMs provided a remarkable enhancement in trans-splicing efficiency. 
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rra^-splicing mediated mRNA repair results in the synthesis of active 
p-galactosidase. At the cellular level, the ultimate criterion for the success of mRNA 
repak is the production of an active protem. Using a western assay it was determined that 
fiilHength p-gal was produced as a result of trans-splicing . Full-length p-gal was not 

5 observed following transfection of 293T cells with plasmids encoding lacZCF9m or 

PTM-CF24. Co-transfection of both plasmids, however, resulted in robust production of 
full-length p-gal protein, which was readily detectable using anti-P-gal antiserum 
(Figure 39). This result complements enzymatic activity data suggests that the latter was 
not due to a complementation by truncated p-gal proteins. The Westem blot analysis 

10 revealed that full-length p-gal protein was made in 293T cells by trans-splicing and 

furthermore confirmed that the PTMs with long binding domains were efficiently spliced. 

Appropriate repair of P-gal mRNA and synthesis of full-length p-gal 
protein should lead to the production of active enzyme. Indeed, 293T cells co-transfected 
with lacZCF9m and PTM-CF24 were shown to have p-gal activity measured either in situ 

1 5 (Figure 40A) or in extracts (Figure 40B). This activity was shown to depend on the 

rran^-splicing between the target pre-mRNA and the PTM. The quantitative in solution 
assay further confirmed the data presented above: PTM-CF22 and PTM-CF24 were 2.9 
and 9.3 fold more efficient respectively than PTM-CF14. Most unpressive, however, 
were results using 293 cells that harbor lacZCF9m as a stable endogenous gene. When 

20 these cells were transfected with PTM-CF 1 4 the levels of p-gal activity obtained were 
barely above background. Transfection with PTM-CF24, however, resulted in a 
considerable level of p-gal activity (Figure 40C). This was paralleled by the appearance 
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of full-length P-gal protein. These data demonstrate a sizeable increase in the efficiency 
of rran^-splicing to repair a mutated pre-mRNA. In fact all prior reports of repair of 
endogenous RNA in manmialian cells by either group I ribozymes or /ra/iy-splicing have 
been only documented using RT-PCR, an indication of the low level of repair. 
5 PTMs with very long binding domains are highly specific. It was shown 

that a secondary structure within the binding domain could enhance specificity of PTMs 
in HeLa nuclear extracts. In order to ascertain the specificity of the rran^-splicing 
reactions in vivo a second target gene was prepared, which could serve as reporter of non- 
specific reactions. This gene, which is referred to as lacZHCGlm, shares the first exon 

10 with lacZCF9m. The intron in lacZHCGlm is intron 1 of the P-subunit of the human 

chorionic gonadotropin gene 6 (PhCG6) and the second exon is exon 2 of the same gene. 
lacZHCGl m drives the synthesis of a pre-mRNA that is spliced correctly to yield a 
chimeric mRNA that does not encode a full-length P-gal (see below). PTM-CF14, 'CF22 
and -CF24 are not targeted to lacZHCGlm pre-mRNA since there is no complementarity 

1 5 between the binding domains in these PTMs and the target gene. Any /raw^-splicing 

between these PTMs and lacZHCGlm pre-mRNA is therefore non-specific (Figure 41 A). 

293T cells were transfected with PTM-CF14, -CF22 or -CF24 and the 
level of non-specific trans-splicing was determined by RT-PCR and by in solution p-gal 
assays. Semi-quantitative RT-PCR suggested that PTM-CF24 was significantly less 

20 likely than PTM-CF14 to irans-splict with lacZHCGlm pre-mRNA. Measurement of p- 
gal activity confirmed this; cells co-transfected with lacZHCGlm and PTM-CF24 
produced 3.7 fold less p-gal than those co-transfected with lacZHCGlm and PTM-CF14 
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(Figure 41C). Based on these data it was estimated that PTM-CF24 is 50 times more 
likely to /ra«5-splice to its target than to a non-specific target. A "safety" version of 
PTM-CF24, SPTM-CF24, did not confer further specificity (Figure 41 C). Nonetheless, 
for PTMs with shorter binding domains a "safety" stem involving the binding domain 
5 was seen to improve specificity in vivo (Figure 41 C). It was concluded from these data 
that the longer binding domains resulted in PTMs that were not only more efficient but 

also more specific. 

The observation that long binding domains increased the specificity of 
PTMs suggested that very long binding domains (>200 nt) could fiirther enhance 

10 discrimination. Plasmids encoding PTM-CF26 and -CF27, which have binding domains 
that span 200 nt and 41 1 nt respectively, were constructed and co-transfected with 
lacZHCGlm plasmid. Non-specific frans-splicing of these two PTMs was barely 
detectable with RT-PCR (Figure 41B). As measured by the p-gal assay PTM-CF26 and - 
CF27 had minimal non-specific /row-splicing activity (Figure 41C). In a specific trans- 

1 5 splicing reaction with lacZCF9m as measured by the solution p-gal assay PTM-CF26 was 
as active as PTM-CF14 (Figure 41B). It was estimated that PTM-CF26 is 80 times more 
likely to /ra«5-splice to the specific target (lacZCF9m) than to a non-specific target 
(lacZHCGlm). Therefore, inclusion of very long binding domains confers to these PTMs 
very high specificity. 

20 The present invention is not to be limited in scope by the specific 

embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in tiie art from the 
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foregoing description and accompanying Figures. Such modifications are intended to fall 
within the scope of the appended claims. Various references are cited herein, the 
disclosure of which are mcorporated by reference in their entireties. 
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